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SYNDECAN STIMULATION OF CELLULAR DIFFERENTIATION 



fl^nnFTHF INVENTION 

This Invention is in the field of cancer biology and therapy. Specifically, 
the invention is to a method for altering the differentiated state of a cell by 
altering syndecan expression. The method allows for the normalization of the 
growth rate and differentiation state of malignant cells, such method being 
based on the stimulation of syndecan expression in malignant cancer cells. Re- 
expression of syndecan in such malignant cells promotes their normal 
differentiated phenotype and prevents their tumor formation. This method may 
also be applied to normal cells to increase their differentiation, and therefore 
support the maintenance of cells, e.g. keratin production to prevent baldness. 

R AGROUND OF THF INVENTION 

It is becoming more evident that cell surface proteoglycans play an 
important role in the regulation of cell behavior (Ruoslahti etal., Cell 64:867- 
869 (1991)). Through their covalently bound glycosaminoglycan side chains, 
such proteoglycans can bind various extracellular effector molecules (Jalkanen, 
etal., in Receptors for Extracellular Matrix, J. MacDonald & R. Mecham, Editors, 
Academic Press, San Diego, pp. 1-37 (1991)). One central challenge in 
proteoglycan biology is the need to understand what directive a cell receives by 
binding different effector molecules via the cell surface proteoglycans. It is 
further imperative to investigate what kind of intracellular response such 
binding activates, thereby leading to altered behavior of the cell. The way to 
approach these questions is to create biological models which are dependent 
on the expression of any given proteoglycans. 

Syndecan is the best characterized cell surface proteoglycan (Saunders 
etal., J. Cell Biol. 705:1547-1556 (1989); Mali etal., J. Biol. Chem. 265:6884- 
6889 (1990)). It was originally isolated from mouse mammary epithelial 
(NMuMG) cells as a hybrid proteoglycan containing both heparin sulfate and 
chondroitin sulfate glycosaminoglycan side chains (Rapraeger et al., J. Biol. 
Chem. 260-A 1 046-1 1 052 (1 985)). Recent studies have revealed its expression, 
not only on epithelial cells but also on differentiating fibroblasts of developing 
tooth (Thesleff etal., Dev. Biol. 729:565-572 (1988); Vainio etal., J. Cell Biol. 
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105:1945-1964 (1989)), on endothelial cells of sprouting capillaries (Elenius 
et al., J. Cell Biol. 1 74:585-596 (1 991 )) and on the surface of lymphocyte 
subpopulations (Sanderson etal., Cell Regul. 7:27-35 (1989)) Intimating that 
Its function can vary on the surfaces of different cells. Syndecan belongs to a 
family of proteoglycans with conserved plasma membrane and cytoplasmic 
domains but with dissimilar ectodomains (Mali etal., J. Biol. Chem. 265:6884- 
6889 (1990)). The conserved structure of syndecan suggests that it could 
participate in signal transduction through the plasma membrane. 

Syndecan binds several extracellular effector molecules but this binding 
is selective. For example, syndecan binds interstitial collagens and fibronectin 
but does not bind to vitronectin or laminin (Koda et al., J. Biol. Chem. 
250:8156-8162 (1985)); Saunders etal., J. Cell Biol. 706:423-430 (1988); 
Elenius etal., J. Biol. Chem. 265:17837-17843 (1990)). Moreover, syndecan 
isolated from tooth mesenchyme has revealed selective binding to tenascin not 
observed for syndecan from NMuMG cells (Salmfvirta etal., J. Biol. Chem. 
265:7733-7739 (1991)). This suggests variation in syndecan glycosylation that 
results in the selective binding properties for syndecan. Polymorphism of 
syndecan glycosylation has also been observed between simple and stratified 
epithelia (Sanderson etal., Proc. Natl. Acad. Sci. USA 55:9562-9566 (1988)); 
but whether these changes also reflect altered ligand recognition by syndecan 
remains unknown. Syndecan also binds growth factors, such as basic 
fibroblast growth factor (bFGF) (Kiefer etal., Proc. Natl. Acad. Sci. USA 
57:6985-6989 (1990); Elenius etal., J. Biol. Chem. 267:6435-6441 (1992)). 
Very recently, Yayon and coworkers (Yayon etal., Cell 54:841-848 (1991)) and 
Rapraeger and co-workers (Rapraeger etal., Science 252:1705-1708 (1991)) 
have shown that heparin-like molecules are required for the binding of bFGF to 
its high affinity receptor, indicating that syndecan-like molecules can also 
modulate the growth factor response. The fact that cell surface proteoglycans 
can bind both growth factors and matrix components could theoretically imply a 
role in regulating, both temporally (timing of expression) and spatially (precise 
localization), growth promotion by immobilizing these effector molecules to the 
vicinity of cell-matrix interactions. This is supported by the intriguing pattern of 
syndecan expression in the development that follows morphogenetic rather 
than histological patterns (Thesleff etal., Dev. Biol. 720:565-572 (1988); Vainio 
etal., J. Cell Biol. 705:1945-1954 (1989) and Vainio etal., Dev. Biol. 734:382- 
391 (1989)), and moreover, that syndecan expression is localized to sites of 
active proliferation (Elenius etal., J. Cell Biol. 774:585-596 (1991) and Vainio 
etal., Dev. Biol. 747:322-333 (1991)). 
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In simple epithelium, syndecan is polarized to baso-lateral surfaces 
where it co-localizes with actin rich cytofilaments (Rapraeger eta!., J. Cell Biol. 
705:3683-2696 (1986)). Upon rounding, syndecan is shed from the cell surface 
by proteolytic cleavage of the core protein at the cell surface, a process which 
separates the matrix binding ectodomain from the membrane domain (Jalkanen 
etal., J. Cell Biol. 1 05:3087-3096 (1987)). By this way, syndecan has been 
proposed to be involved In the maintenance of epithelial morphology. Mouse 
mammary tumor cells (S1 15), when steroid-induced to modulate their 
morphology from an epithelial to a more fibroblastic or fusiform phenotype, lose 
syndecan expression (Leppa etal., Cell Regul. 2:1-11 (1991)), as to several 
other cell types in vivo when they transform (Inki etal., Am. J. Pathol. 
739:1333-1340 (1991); Inki etal., Lab. Invest. 66:314-323 (1992)), suggesting 
a general loss of syndecan expression during malignant transformation. 

ftl 1MMARY OF THE INVENTION 

It is therefore an object of the subject invention to provide a method of 
altering the differentiated state of a host cell by altering syndecan expression in 
such cell. 

It is a further object of the invention to provide a method to induce and 
regulate syndecan expression, especially in cells which exhibit a malignant 
phenotype, regardless of the origin of transformation. 

It is a further object of the invention to provide a treatment for the 
reduction of tumor growth in a patient in need of such treatment, by 
administration of a composition to such patient, such composition comprising 
efficacious amounts of one or more agents that stimulate syndecan synthesis in 
the tumor cells of such patient. 

It is a further object of the Invention to provide the DNA sequence and 
localization of promoter, suppressor and enhancer elements for the syndecan 
gene. 

It is a further object of the invention to provide a method for the 
enhancement of syndecan expression in a host cell, by enhancing syndecan 
gene transcription. 
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It is a further object of the invention to provide a method for the 
enhancement of syndecan expression in malignant cells, by preventing 
suppression of syndecan gene transcription.- 

It is a further object of the invention to provide a biochemical method for 
the inactivatiori of suppressors of syndecan gene expression in malignant cells. 

It is a further object of the invention to provide a method for the 
stimulation of cellular differentiation by enhancing syndecan expression in both 
malignant and normal cells. 

It is a further object of the invention to provide a method for the 
stimulation of cellular proliferation and differentiation, thus tissue regeneration, 
especially in processes such as wound healing, by enhancing syndecan 
expression. 

Further features, objects and advantages of the present invention will 
become more fully apparent from a detailed consideration of the following 
description of the subject invention when taken together with the accompanying 
drawings. 

RRIFF nFSHRIPTinN OF THE DRAWINGS 

Figure 1 . Figure 1 is a diagram of the assembly of mouse syndecan 
gene and its promoter region. 

Figure 2. Figure 2 is the complete sequence of the mouse syndecan 
gene [SEQ ID Nos. :1 : (DNA) and 2: (protein)] and its proximal promoter. 
Regulatory sites for the expression of syndecan may also exist on the first intron 
following the first exon (see Figure 1 ). 

Figure 3. Figure 3 is a diagram of the assembly of mouse syndecan 
promoter region and the localization of the enhancer and suppressor elements 
together with restrictions sites for three different enzymes. 

Figure 4. Figure 4 is the complete sequence of the mouse syndecan 
enhancer element [SEQ ID No. :3: (DNA)].located 8-1 0 kbs upstream from the 
transcription initiation site as indicated in figure3. 

Figure 5. Figure 5A (panels A-D) is a photographic presentation of 
reduced growth ability of syndecan-transfected cells in soft agar. Panel A (wild- 
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type S1 15 cells) and B (control transfected cells) are pictures of the colonies 
that are formed in soft agar In the presence of testosterone, a feature typical for 
hormone-transformed cells. This growth ability was not observed with two 
independent syndecan-transfected cell clones (panels C and D), demonstrating 
how syndecan re-expression can overcome the effect of hormone-induced 
transformation. Figure 5B is a graphical presentation of how syndecan- 
transfected cells lose their ability to form tumors in nude mice. Wild-type or 
control transfected cells produce tumors in testosterone-administered nude 
mice while syndecan transfected cells revealed a very low tendency to produce 
tumors. 

Figure 6 is a graphical representation of enhanced syndecan expression 
in 3T3 cells by simultaneously administered basic fibroblast growth factor 
(bFGF) and transforming growth factor beta (TQF-6). This is an example of how 
syndecan expression can be enhanced as a result of growth factor action in 
normal cells during the differentiation process. 

Figure 7 is a graphical representation of enhanced syndecan expression 
by MCF-7 cells exposed to the anti-estrogen toremifene. When exposed to 
estrogen, syndecan expression in MCF-7 ceils was reduced and the cells 
transformed. Subsequent treatment with the anti-estrogen (toremifene) 
restored syndecan expression to levels close to that found in cells not exposed 
to estrogen and aided the cells in maintaining their normal growth behavior. 

Figure 8 is a graphical presentation of how the suppressor element (see 
figure 3) is acitve in S1 15 cells treated with testosterone. Indicated stretches of 
promoter sequences were transfected in hormone-treated S115 cells and 
analysed for their transcription activity as described in Example VI. A dramatic 
drop was observed wiht suppressor construct as indicated in Figure 3, and it 
was more obvious in transformed S115 (a) cells than in 3T3 cells (b). 

Figure 9 is a graphical presentation of how the enhancer element is 
active in growth hormone-treated 3T3 cells. Various stretches of promoter were 
transfected in 3T3 cells and analysed for their transcription activities. Fragment 
pXb6, which is the same as illustrated in Figure 3 as an enhancer, revealed 
more than ten fold stimulation in 3T3 cells exposed to growth factors bFGF and 
TGFB if compared to non-treated cells. 
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DEFINITIONS 

In order to provide a clearer and more consistent understanding of the 
specification and claims, including the scope to be given such terms, the 
following definitions are provided. 

"Enhancement" nr "Stimulation" of Syndecan expression. By 
"enhancement" or "stimulation" of syndecan expression" is meant an effect of 
increasing the synthesis of syndecan, either by the Induction or de-suppression 
(de-repression) of syndecan gene transcription and/or translation. 

Cell growth . By "cell growth" is meant cell replication, both controlled and 
uncontrolled. 

Malignant . By "malignant" is meant uncontrolled cell growth. 

More Differentiated Phenotvoe . In stating that a cell has a "more 
differentiated phenotype" is meant that the cell possesses a phenotype usually 
possessed by a certain cell type more differentiated than the cell, which the cell 
was deficient in prior to enhancement of syndecan expression according to the 
invention. This phenotype may be defined by one or more phenotypic 
characteristics. For example, an epithelial cell is a more differentiated 
phenotype of a mesenchymal-like shape; therefore, the ability of the method of 
the invention to maintain cells in an epithelial cell morphology rather than a 
mesenchymal-like shape is a more differentiated phenotype within the meaning 
of the definition. Continuous syndecan expression is necessary for the 
maintenance of terminal differentiation of epithelial cells. 

Syndecan expression is also linked to the normal differentiation of 
mesenchymal cells. However, only a transient increase in syndecan expression 
is needed for normal differentiation in mesenchymal cells. Therefore, contrary to 
epithelial cells, expression of syndecan Is not needed for maintenance of 
terminal differentiation in mesenchymal cell. To induce differentiation of a 
suitable mesenchymal precursor cell population (such as a "condensing 
mesenchymal" cell population) to a fully differentiated mesenchymal cell, there 
is needed only a transient expression of syndecan expression. Therefore, a 
terminally differentiated mesenchymal cell is a "more differentiated phenotype" 
than a condensing mesenchymal cell. 
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Other useful phenotypes that are present in syndecan-deficient cells and 
not in cells expressing sufficient syndecan include fusiform shapes with long 
filopodial extensions with extensive under-and overlapping of these processes 
(so that the cells appear to have a defect in cell adhesion). 

in another example, syndecan-deficient NMuMG cells continue to secrete 
milk fat globule antigen (and thus appear mammary-like) and continue to 
express cytokeratins (thus appear epithelial-like). However, their actin- 
containing cytoskeleton is disorganized and their expression of betai integrins 

and E-cadherins at the cell surface is markedly reduced. Upon expression of 
sufficient syndecan, these phenotypes are corrected so that there is no 
reduction in cell surface integrins or E-cadherin and the cell has an epithelial 
morphology. Therefore, the amount of cell surface integrins or E-cadherin are 
useful markers of syndecan expression and may be used to monitor what : 
amount of a drug is needed for efficacious results according to the method of 
the invention. 

Efficacious Amount . An "efficacious amount" of an agent is an 
amount of such agent that is sufficient to bring about a desired result, especially 
upon administration to an animal or human. 

Administration. The term "administration" is meant to include introduction 
of agents that induce syndecan expression into an animal or human by any 
appropriate means known to the medical art, including, but not limited to, 
injection, oral, enteral and parenteral (e.g., intravenous) administration. 

Pharmaceutical^ Acceptable Salt. The term "pharmaceutically 
acceptable salt" is intended to include salts of the syndecan-inducing agents of 
the invention. Such salts can be formed from pharmaceutically acceptable 
acids or bases, such as, for example, acids such as sulfuric, hydrochloric, nitric, 
phosphoric, etc., or bases such as alkali or alkaline earth metal hydroxides, 
ammonium hydroxides, alkyl ammonium hydroxides, etc. 

Pharmaceutica lly Acceptable Vehicle. The term "pharmaceutically 
acceptable vehicle" is intended to include solvents, carriers, diluents, and the 
like, which are utilized as additives to preparations of the syndecan-inducing 
agents of the invention so as to provide a carrier or adjuvant for the 
administration of such compounds. 



Treatment. The term "treatment" or "treating" is intended to include the 
administration of compositions comprising efficacious amounts of syndecan- 



WO 94/12162 




VFI93/00514 



inducing agents to a subject for purposes which may include prophylaxis, 
amelioration, prevention or cure of a medical disorder, including tumor growth, 
hair growth. ~ 

HPTAII FD nFfiftRIPTION OFTHF PREFERRED EMBODIMENTS 

Briefly, in its broader aspects the present invention comprehends a 
method for maintaining a differentiated phenotype in a normal (non-malignant) 
cell that otherwise wduld suppress syndecan expression, by maintaining 
syndecan expression in such cell. The invention also comprehends a method 
for inducing a more differentiated phenotype in a malignant cell that lacks (or is 
deficient In) syndecan expression, by stimulating syndecan expression in such 
a cell. As used herein, if a cell is said to "lack" syndecan expression, it is meant 
to include cells that are either completely deficient in syndecan protein or 
produce insufficient syndecan levels to maintain or attain a desired 
differentiated phenotype. 

The methods of the invention will not only prevent the progression 
(worsening) of a transformation state and tumor growth of cells, but can also be 
used to maintain differentiated cells in their differentiated state so that they 
continue to perform differentiated functions characteristic of such cells. 
Examples of differentiated functions of non-malignant cells include the 
secretion characteristics of a cell (that is, the secretion of specific proteins 
and/or other macromolecules) and hair formation by epidermal cells of skin. 
Thus, according to the invention, administration of agents capable of inducing 
syndecan expression in epidermal skin cells of the scalp will promote hair 
growth among bald (or balding) people. 

The subject method may be accomplished in a variety of manners 
including biochemical, chemical or even molecular biological type methods. 
While the method is applicable to a variety of cancer (both malignant and non- 
malignant) and normal cells, It is particularly adaptable for treating malignant 
cells which have become transformed. This includes cells transformed due to 
hormonal influences of the body or environmental influences, such as 
chemicals or radiation exposure. Especially, the tumor type is a tumor 
characterized by loss of syndecan, for example, a glioma, myeloma, carcinoma, 
sarcoma, lymphoma, or adenoma. 

Generally, any cell genetically capable of expressing syndecan can be a 
cell stimulated to express syndecan by the method of the invention. Syndecan - 
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is naturally expressed In a wide variety of epithelial cells in mature and 
embryonic tissues and by various embryonic mesenchymal tissues undergoing 
inductive interactions with eplthelia. In addition, syndecan is naturally 
expressed on Leydig cells and on developing B-lymphocytes and a 
subpopulation of plasma cells. 

Enhanced syndecan expression may be achieved by administration of 
compositions containing a biochemically, and/or chemically and/or molecular- 
biologically active component to an individual. Compositions may be 
administered orally, intravenously, subcutaneously or locally, or by any other 
method which will allow cells, normal or malignant, to be exposed to syndecan 
expression enhancing component. 

By a "biochemically" or "chemically" active component is meant a 
component that alters the endogenous syndecan biochemistry or chemistry of 
the target cell without altering syndecan gene expression perse. For example, 
such alterations may include altering the half-life of syndecan protein or mRNA, 
so as to increase levels of syndecan protein in the cell, for example, by altering 
the external domain of the cell's endogenous syndecan, or the cell surface 
membrane properties in general, so as to retain higher levels of syndecan on 
the cell surface; and, altering the syndecan protein active site(s), so as to 
enhance the efficiency of the syndecan response. 

By a "molecular-biologically" active component is meant a component 
that alters endogenous syndecan gene expression in a manner that allows for 
an increase in cellular syndecan, such as, for example, by stimulating 
transcription, preventing (or reducing) suppression of transcription, de- 
repression transcription, and generally Increasing levels of mRNA and/or 
translation efficiency. 

It is known that cellular transformation involves activation of some cell 
growth stimulating genes (like oncogenes) and inactivation of some other 
genes, which work to suppress cell growth. It has recently been shown that loss 
of syndecan expression is observed upon transformation of cells, and that this 
suppression is due to syndecan gene inactivation (Leppa et a/., Cell Regul. 2:1- 
11 (1991); Inki era/., Am. J. Pathol. 139:1333-1340 (1991); Inki era/., Lab. 
Invest. 66:314-323 (1992)). This was demonstrated in several biological 
models of various known carcinogenic systems (oncogenes, chemical 
carcinogens, UV-light, hormone-exposure, etc.). Therefore, syndecan gene 
suppression is implicated in the development of cellular transformation. 
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Further, according to the invention, ail the manipulations of such cells 
which can induce syndecan expression in malignant cells, also induce these 
cells to obtain a more differentiated phenotype, and thus, subsequently reduce 
their potential tumorigenic behavior and metastasis. 

In a preferred embodiment, the cell in which syndecan expression is 
stimulated is a cell that is steroid-responsive. Examples of such steroid- 
responsive cells include breast cells, endometrium cells and prostate cells, 
especially in the malignant state. In a highly preferred embodiment, the cell is 
responsive to estrogen and/or androgen. 

Examples of other ceil types that will respond to the treatment of the 
invention include malignant and non-malignant mesenchymal cells. 

The regulatory elements of a given gene are commonly located upstream 
from (5' to) the transcription initiation site. Syndecan, however, has a very 
peculiar gene structure, in which the first and second exons are separated by a 
very large Intron (Figure 1). This could mean that, in addition to the base 
sequences upstream from the transcription site, syndecan expression may also 
be susceptible to regulation by base sequences located between first and 
second exons (Figure 2), that is, in the first intron. 

It has now surprisingly been found that the syndecan gene has a strong 
enhancer element located 8-10 kb upstream from the transcription initiation site. 
The sequence of this enhancer element has been identified and is given in 
Figure 4 [SEQ ID. No. 3]. 

Manipulation of the upstream region of the syndecan gene can block its 
inactivation during malignant transformation. For example, replacement of the 
region in front of first exon of the syndecan gene with the glucocorticoid- 
inducible elements of mouse mammary tumor virus (MMTV) not only blocks 
syndecan suppression during malignant transformation, but also inhibits the 
ability or potential of cells to transform and become tumorigenic (Figure 3). 
These findings suggest a very important role for syndecan in the maintenance 
of normal epithelial morphology (Leppa etai., Proc. Natl. Acad. Sci. USA 
53:932-936(1992)). 

Cells destined to differentiate during organ formation or tissue 
regeneration also exhibit enhanced syndecan expression (Vainio etai, Dev. 
Biol. 747:322-333 (1991); Elenius etai., J. Cell Biol. 114:585-595 (1991)). The 
component(s) responsible for the regulation of syndecan expression (either 
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directly or indirectly) have not yet been identified. Growth factors are 
candidates for this role since they are known to be involved in the regulation of 
early development and cellular differentiation (Heath etal., Curr. Opin. Cell 
Biol. £935-938 (1991)). Involvement of growth factors is also supported 
indirectly by the fact that the expression of two embryonally important growth 
factors (TGF-B and FGF) has been shown to coincide with syndecan within 
developing tooth (Vaahtokari etal., Development 773:985-994 (1991); 
Wilkinson etal., Development 705:131-136 (1989)). 

Based on these findings the possible effect of growth factors on the 
expression of syndecan has been tested. It was shown that both bFGF and 
TGF-B alone, but especially if administered in combination, enhance syndecan 
expression by 3T3 cells (Figure 4). This stimulation is close to the levels 
observed in syndecan-expressing epithelial cells (Eienius etal., J. Biol. Chem. 
257:6435-6441 (1992)) prior to their malignization (Lappa etal., Proc. Natl. 
Acad. Sci. USA 59:932-936 (1992)). These findings suggest that growth 
factors, and their derived fragments and domains may prove to be valuable for 
the development of an active tool for the regulation of syndecan expression. 

Preferably, for treatment of humans and animals, a drug is administered 
that results in the enhancement of syndecan expression to levels sufficient to 
facilitate cellular differentiation in the degenerative stages of tissues. Such 
drugs are herein termed "syndecan-inducing agents." Syndecan-inducing 
agents include the growth factors and their derivatives that retain growth-factor 
activity. Examples of such growth factors include bFGF, and TGF-B, whether 
administered separately or together. 

Even more preferred is a syndecan-inducing agent that has good tissue 
and cell penetration so that it could directly interfere with suppressors) of 
syndecan expression within cell nuclei. Such a syndecan-inducing agent is the 
known antitumor drug toremifene. When toremifene, known to have good 
plasma membrane penetration, is administered to the hormone-transformed 
epithelial cells with reduced syndecan expression, the cells reverse their 
lowered syndecan expression, and induce syndecan levels to those close to 
the level observed with normal, non-transformed cells (Figure 5). This 
demonstrates that syndecan-inducing agents useful in the methods of the 
invention are known and available and that such agents can specifically 
prevent cellular malignization by blocking suppression of syndecan expression. 
Another useful drug in this regard is tamoxifen. 
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Such syndecan-inducing agents may be administered using currently 
available preparations, or in any pharmaceutically acceptable vehicle. The 
route of administration may be any route that delivers efficacious levels of the 
drug to the desired active site, for example, by injection. 

For parenteral administration, preparations containing one or more 
syndecan-inducing agents may be provided to the patient in need of such 
treatment in combination with pharmaceutically acceptable sterile aqueous or 
non-aqueous solvents, suspensions and emulsions. Examples of non-aqueous 
solvents are propylene glycol, polyethylene glycol, vegetable oil, fish oil, and 
injectable organic esters. Aqueous carriers include water, water-alcohol 
solutions, emulsions or suspensions, including saline and buffered medical 
parenteral vehicles including sodium chloride solution, Ringer's dextrose 
solution, dextrose plus sodium chloride solution, Ringer's solution containing 
lactose, or fixed oils. Intravenous vehicles include fluid and nutrient 
replenishers, electrolyte replenishers, such as those based upon Ringer's 
dextrose and the like. 

The syndecan-inducing agent of the invention can be employed in 
dosage forms such as tablets, capsules, powder packets, or liquid solutions for 
oral administration If the biological activity of the syndecan-inducing agent is 
not destroyed by the digestive process and if the characteristics of the 
compound allow it to be absorbed across the intestinal tissue. 

The syndecan-inducing agents may also be administered by means of 
pumps, or in sustained-release form. The syndecan-inducing agents used in 
the method of invention may also be delivered to specific organs in high 
concentration by means of suitably inserted catheters, or by providing such 
molecules as a part of a chimeric molecule (or complex) which is designed to 
target specific organs. 

Administration in a sustained-release form is more convenient for the 
patient when repeated injections for prolonged periods of time are indicated. 

The composition containing the syndecan-inducing agent can be 
manufactured in a manner which is in itself know, for example, by means of 
conventional mixing, granulating, dragee-making, dissolving, lyophilizing or 
similar processes. The compositions of the present invention that provide the 
syndecan-inducing agent, in and of themselves, find utility in their ability to slow 
or prevent tumor growth or tumor reappearance. The syndecan-inducing 
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compositions of the invention utilize the body's own mechanisms for promoting 
differentiation of specific cell types to its maximum potential. 

In intravenous dosage form, the compositions of the present invention 
have a sufficiently rapid onset of action to be useful in the acute management of 
tumor growth. 

Additionally, a low potency version is useful in the management of 
disorders wherein a tumor has been effectively treated and the patient appears 
to be In remission, but it is desired to maintain sufficient levels of syndecan- 
inducing agents in the patient so as to assist the body in preventing a 
recurrence of the tumor. 

Typical doses of toremifene or tamoxifen, and other such syndecan- 
inducing agents useful in the methods of the invention for treatment of humans 
or other artimals are 20-600 mg daily, and preferably 20-60 mg dally. 

The examples below are for illustrative purposes only and are not 
deemed to limit the scope of the invention. 

EXAMPLE I 

Raversal of h ormone-inducftri transformation bv exogenous svndecan 

expression. 

As previously described (Leppa eta!., Cell Regulation 21-11 (1991)), 
S1 15 mouse mammary tumor cells were routinely cultured in DMEM. For 
experimental studies involving hormone treatment, inactivated fetal calf serum 
(i-FCS) was replaced with 4% dextran charcoal-treated fetal calf serum (DCC- 
FCS), which eliminates endogenous steroids from serum), and used either with 
or without testosterone (10 nM) and with or without dexamethasone (10 nM or 1 
uM). Cells were plated at a density of 10,000 cells/cm 2 and the medium was 
replenished every 3 days. 

Plasmid pUC19-hsynpr7 containing human syndecan cDNA (Mali etai, 
J. Biol. Chem. 265:6884-6889 (1990)) was digested with Nae\ restriction 
endonuclease, and the derived 336 bp long-fragment was separated in and 
eluted from low melting agarose gel. Plasmid pUC19-hsyn4 (Mali etai, J. Biol. 
Chem. 265:6884-6889 (1990)) was digested with Nae\ and Hind\ (polylinker 
site), and the plasmid-containing fragment starting from base 487 was isolated. 
The Nael fragment from hsynpr7 was ligated to the pUC-hsyn NaeVHinM 
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digested vector. The orientation of insert was verified by restriction enzyme 
analysis and sequencing. The derived piasmid, containing the full coding 
region of human syndecan core protein, was named pUC19-hsynfull. This 
piasmid was further digested with BamHI and Sph\ (polylinker site). A fragment 
containing syndecan coding region bases 150-1461 was isolated and blunt- 
ended, using Klenow and T4 DNA polymerase. Finally, this fragment was 
ligated to Sa/l-llnearized and blunt-ended pMAMneo vector (Clontech; Palo 
Alto, CA), resulting in a chimeric gene containing RSV-MMTV-LTR promoter 
connected to human syndecan coding region and SV-40 poiyadenylation 
signal. The orientation was confirmed by restriction enzyme digestions. The 
piasmid was named pMAMneo-hsyn. 

For control transections, a 642 bp iong-H/ndlll/Pvuil fragment of human 
growth hormone gene (consisting of exons 4 and 5; Bornstein etal., J. Biol. 
Chem. 263:1 603-1 606 (1 988)) was blunt-ended and cloned into the same 
pMAMneo vector, as described above. This control construct was named 
pMAMneo-hGH. 

All plasmids were isolated using the CsCI density gradient method. 
Before transfections, plasmids were linearized with MM, chloroform/phenol 
extracted and ethanol precipitated. 

Transfections were performed using Lipofectin™ (BRL), according to 
manufacturer's instructions. After selection for two weeks (G418; 750 ng/ml, 
Sigma), surviving clones were isolated from growth plates using cloning 
cylinders. The expression of human syndecan or growth hormone (consisting 
of exons 4 and 5) mRNAs was then confirmed by RNA isolation and Northern 
blot analysis. Clones expressing high levels of transfected genes were selected 
for further studies and characterizations. These stock cells were routinely 
cultured in the presence of G418 (300 jig/ml). 

For the measurement of exogenous syndecan expression total RNA was 
isolated from wild-type S1 15 cells and cells transfected with human syndecan 
or growth hormone genes. RNA was extracted using 4M guanidine 
isothiocyanate and CsCI pelleting, as earlier described by Chirgwin et ai, 
Biochemistry 75:5294-5299 (1979)). RNA from normal mouse mammary 
NMuMG and normal human mammary HBL-100 cells was used for comparison. 
RNA aliquots of 15 ng were separated in 1% formaldehyde agarose-gel 
electrophoresis and transferred to GeneScreen Elus™ hybridization membrane 
(New England Nuclear). Blots were hybridized with multiprime (Amersham) 
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labeled Inserts of either mouse (PM-4) (Saunders etal., J. Cell Biol. 70&1547- 
1556 (1989)) or human syndecan (pUC19-hsyn4 fiamH1 1.1 kb fragment) (Mall 
etal., J. Biol.Chem. 265:6884-6889 (1990)), or with human growth hormone 
exons 4 and 5 (hGH) (Leppa etal., Proc. Natl. Acad. ScL USA $9:932-936 
(1992)) cDNAs, using the high stringency conditions suggested by the 
manufacturer of the membrane (New England Nuclear). All techniques based 
on modern molecular biology are fully explained in the literature such as in the 
laboratory manual entitled Current Protocols in Molecular Biology. 

Anchorage Independent cell growth was measured in a soft agar colony 
assay. The six well-plates were first covered with an agar layer consisting of 2 
ml DMEM, 0.5% agar and 4% DCC-FCS. The middle layer contained 10 4 cells 
in 0.5 ml DMEM supplemented with 0.33% agar and 4% DCC-FCS, with or 
without 10 nM testosterone. The uppermost layer, consisting of medium (2 ml), 
was added to prevent drying of the agarose gels. The plates were incubated at 
37*C in 5% CO2 for 12 days after which cultures were evaluated and 

photographed. 

Tumorigenicity of S1 15 wild type cells, one hGH transfected control clone 
and two clones expressing human syndecan-1 was measured In nude mice. 
For this cells were cultured In DMEM containing 5% FCS and 10 nM 
testosterone. After four days in culture, cells were harvested with trypsin, 
washed, and 10 7 cells suspended in 0.2 ml of DMEM were injected 
subcutaneously into back of athymic male nude mice (balb-C). Silastic 
testosterone capsule, which is known to increase the growth rate of S115 cells 
(King etal., J. Steroid. Biochem. 7:869-873 (1976)) was simultaneously 
implanted. Nude mice were examined regularly for tumor development and the 
size of the palpable tumors measured at intervals. 

EXAMPLE II 

firowth facto rs anhance svnriecan expression. 

NMuMG mouse mammary epithelial cells and 3T3 (NIH) mouse 
fibroblasts were routinely cultured in bicarbonate-buffered Dulbecco's modified 
Eagle's medium (DMEM; GIBCO) containing 10% FCS (GIBCO) and antibiotics, 
as previously described (Elenius etal., J. Biol, Chem. 265:17837-17843 
(1990)). For experiments, cells were plated at equal density on culture dishes 
(Nunc) and grown to 60 - 70% confluency. Twenty-four hours before 
supplementing growth factor(s) to the medium, fresh medium containing 2% 
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CMS-FCS (Vogel ©fa/., Proc. Natl. Acad. Sci. USA 75:2810-2814 (1978)) was 
replaced. Equally treated cultures without growth factor addition served as 
negative controls. Porcine TGF61 (R&D), recombinant human bFGF 
(Boehringer) and murine EGF (Sigma) were used in final concentrations of 2.5 
ng/ml (100 pM), 10 ng/ml (570 pM) and 1.2 ng/ml (200 pM) respectively, in all 
experiments. For quantitation and isolation of cell surface syndecan, media 
were discarded at time points indicated in the text and the cell layers were 
washed twice with ice cold phosphate buffered saline (PBS). Cells were 
scraped with a rubber policeman into ice cold PBS supplemented with 0.5 mM 
EDTA and centrifuged. After subsequent washes by resuspension and 
centrifugation the cell numbers were measured by counting the nuclei with a 
Coulter Counter (Coulter Electronics). 

For quantitation of syndecan intercalated into the cell membrane, 
syndecan ectodomain was released by incubating washed cells in 20 ng/ml 
bovine pancreatic trypsin (Type III; Sigma) in PBS for 10 min on Ice bath. After 
incubation the cells were centrifuged, leaving the ectodomain in the 
supernatant (Rapraeger et al., J. Biol. Chem. 260:11046-11052 (1985)). 
Sample volumes equal to 400,000 or 200,000 cells for 3T3 or NMuMG cells, 
respectively, were loaded on cationic nylon membrane (Zeta-Probe; BioRad) In 
a minrfold-slot apparatus (Schleicher and Schuell), as previously described 
(Jalkanen etal., J. Cell Biol. 705:3087-3096 (1987)). Nonspecific binding was 
blocked by incubating the membrane for one hour at room temperature in PBS 
containing 10% FCS. Syndecan attached to the membrane was detected with 
a monoclonal antibody against mouse syndecan core protein (mAB 281-2) 
(Jalkanen et ah, J. Cell Biol. f0f:976-984 (1985)) that was radioiodinated by 
chloramine-T oxidation method (Stahli etal., Meth. Enzymol. 92:242-253 
(1983)). The membrane was incubated overnight at 4*C with 125|-labeled 281- 
2 in PBS + 10% FCS (10,000 CPM/ml). After five washes with PBS the bound 
antibody was visualized by autoradiography. 

The accumulation of syndecan ectodomain into the medium was 
estimated by taking samples corresponding to 1/50 (3T3 cells) or 1/100 
(NMuMG cells) of the total volume of the remaining medium at selected time 
points. The samples were analyzed by loading them to nylon membrane as 
described above. The autoradiography signal was quantitated with a GelScan 
XL ultroscan densitometer (LKB) using GelScan XL 2400 software (LKB). 

For syndecan purification, cells were radiolabeled for 24 hours in low 
sulfate DMEM (MgCfe substituted for MgS04; 2% CMS-FCS) with 100 uCi/ml 
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35 S04 (New England Nuclear) in the presence or absence of growth factor(s). 
Cell surface trypsin-releasable material was collected, as described above, and 
after dialysis against Tris-puffered saline (TBS), the sample was loaded onto a 
281-2-Sepharose CL-4B immunoaffinlty column (Jalkanen etal., J. Cell Biol. 
1 053087-3096 (1987)). Bound material was eluted with 50 mM triethylamine 
(TEA) (pH 1 1 .5) and the amount of radioactive PG in each fraction was 
analyzed using cetylpyridiumchloride-impregnated Whatman 3MM filter discs 
(Rapraeger etal., J. Biol. Chem. 260:11046-11052 (1985)). For interaction 
experiments, fractions containing most of the labeled PG were pooled and 
diaiyzed against PBS. 

To obtain unlabeled syndecan ectodomain for interaction assays (see 
below) the same procedure was used except that no radioactive sulfate was 
added to the culture medium and the syndecan containing fractions eluted from 
the immunoaffinlty column were detected by immuno-dot assay using mAB 281- 
2. The estimation of the molar concentration of syndecan was based on the use 
of previously determined syndecan concentration by total amino acid analysis 
(Jalkanen etal., J. Cell Biol. 706.953-962 (1988)). 

SDS-PAGE and Western Blot - For western blot experiments cells were 
cultured 24 hours with or without growth factor(s). Syndecan ectodomain 
containing material released from the cell surface by trypsin treatment was 
fractionated on SDS-PAGE gradient (2 -15%) gel (O'Farrel, J. Biol. Chem. 
250:4007-4021 (1975)). After electrophoresis, samples were transferred onto 
Zeta-Probe membrane using electroblotting 2005 Transphor apparatus (LKB). 
The syndecan antigen on the filter was detected with radioiodinated mAB 281-2 
and the filter was washed, as described above for slot blot analysis. 

Northern Blot- RNA was isolated from 3T3 and NMuMG cells using 4 M 
guanidine isothiocyanate and CsCI density centrifugation (Chirgwin etal., 
Biochemistry 75:5294-5299 (1979)). RNA samples were size-separated on 1% 
agarose formaldehyde gel, transferred to GeneScreen Plus™ membrane (New 
England Nuclear) and hybridized with multi-prime (Amersham) labeled partial 
cDNA clone for mouse syndecan (PM-4) (Saunders etal., J. Cell Biol. 
705:1547-1556 (1989)): After hybridization the membrane was washed in 2 x 
SSC and 1.0% SDS at 65*C (high stringency conditions). For rehybridization 
with glyceraldehyde-3-phosphate-dehydrogenase (GAPDH; Fort etal., Nucleic 
Acid Res. 73:1431-1442 (1985)) the bound PM-4 probe was removed as 
recommended by the manufacturer of the filter (NEN). 
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EXAMPLE HI 

induction of svnriacan mR MA repression in the human breast cancer cells 
{MC.F-7\ growth-inhibited with toremifene. 

The steroid-responsive human breast cancer cell line MCF-7 was used to 
study the expression of human syndecan under different growth conditions 
regulated by estrogen and antiestrogen. For the experiment cells were plated 
at a density of 1 .2 x 10 6 cells/100 mm 0 plastic culture dishes and grown as 
monolayer cultures In 10 ml per dish of phenol red-free DMEM medium with 5% 
dextran/charcoal stripped fetal calf serum (DS-FCS), 2 mM L-glutamine and 3 
ug/ml Insulin. For hormone-treatment 1 nM 176-Oestradiol (E2), alone or with 
1-6.25 uM toremifene, dissolved 70% in ethanol, were added to the culture 
medium on the day following plating. The cells were cultured for 6 days, and 
the media were changed every second day. For RNA extraction the cells were 
washed In situ with PBS and scraped from the plates in 4 M guanidine 
isothiocyanate. 

EXAMPUE IV 

Treatment of Sterold-Resoonsive Tumors in Patients. 

Patients diagnosed as having a steroid-responsive tumor selected from a 
breast tumor, an endometrium tumor, a prostate gland tumor or a mesenchymal 
tissue tumor are administered a composition that contains efficacious amounts 
of the anti-steroid agent toremifene or tamoxifen, or efficacious amounts of the 
growth factor bFGF, TGF-B or bFGF together with TGF-8, In amounts ranging 
from 20-600 mg per day, depending upon the extent of the tumor, the patient's 
age, the patient's sex, and other treatments such as are taken into 
consideration in designing such chemotherapeutic protocols. The syndecan- 
inducing agent is administered for a period of time sufficient to induce syndecan 
in the tumor cells, such that the tumor cells now take upon a more differentiated 
phenotype and such that the growth of the tumor is arrested or significantly 
slowed by the treatment. 

EXAMPLE V 

Stimulation o f Hair Growth in Epidermal Skin Cells. 



Patients diagnosed as being in need of increased hair growth in the 
scalp region are administered a composition that contains efficacious amounts 



WO 94/12162 




PCT/FI93/00514 



of the antl-steroid agent toremlfene or tamoxifen, or efficacious amounts of the 
growth factor bFGF, TGF-8 or bFGF together with TGF-B, In amounts ranging 
from 20-600 mg per day, depending upon the extent of the needed hair growth, 
the patient* age, the patient's sex, and other treatments such as are taken into 
consideration in designing such protocols. The syndecan-inducing agent is 
administered for a period of time sufficient to induce syndecan in the epidermal 
cells, such that hair growth is significantly increased by the treatment. 

FYAMPLEVI 

Dftrmination of Mouse Syrian Promoter and Enhancer Activities 

The mouse syndecan gene has been cloned and characterized up to -10 
kbs upstream from the transcription start site. To determine the specific 
activities of different proximal promoter regions (up to -2 kbs from the start site) 
and enhancer regions (from -2 to -10 kbs) we have made plasmid constructs 
where these regions were cloned into pCAT basic or pCAT promoter vectors, 
containing the CAT reporter gene. The reporter CAT gene produces 
chloramphenicol acetyltransferase enzyme, which transfers the n-butyryl moiety 
of n-butyryl CoA to chloramphenicol. The n-butyryl chloramphenicol can be 
separated from native chloramphenicol by xylene. 

For the further characterization of the proximal syndecan promoter a 
series of retriction enzyme treatments was made on the upstream region (Hind 
ill, Hind II, Bgl II, Stu I, Dra I, Cla I, BamHI and Pst I - Xho I) and the obtained 
fragments, were cloned into the pCAT basic vector. For enhancer areas, three 
Xba I fragments were cloned into a pCAT promoter vector, where the SV 40 
promoter was displaced by the Bgl II • Xho I fragment from the syndecan 
promoter. 

The plasmid constructs were transiently transfected into eukaryotic cells 
by calcium phosphate precipitation simultaneously with a B-Galactosidase 
expressing vector to determine the transfection efficiency. After a four hour 
incubation cells were treated with 15% glycerol and grown for approximately 48 
h in cell culture medium. Cells were then scraped from dishes in TEN-buffer 
and the cytoplasmic extract was obtained by repeated freezing and thawing. 6- 
Galactosidase acitivity was obtained in hte cytoplasmic extreact by adding 0.1 
M sodium phosphate, 45 mM mercaptoethanol and 0.2 mg ONPG. This was 
incubated from 2 hours to overnight and the colour reaction was measured 
spectrofotometrically at 420 nm. 
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The CAT activity was determined by adding 0.25 M Tris buffer, 25 ng n- 
butyryl CoA and 0.0626 fiCi of nc-chloramphenicol to the cytoplasmic extract. 
These were Incubated overnight, backextracted wiht xylene and radioactivity 
was measured by scintillation counting. The CAT activity was corrected for 
transfection efficiency by B-galactosidase activity. 

The cells used for proximal promoter constructs were 3T3 NIH, S1 15 
(either hormone-treated of not) and nMuMG cells. For enhancer constructs we 
used 3T3 NIH cells grown in 2% CMS medium and to test the effect of growth 
factors in 2% CMS medium with 10 ng/ml FGF-2 and 2ng/ml TGF6-1. 



All references cited herein are fully incorporated herein by reference. 
Having now fully described the invention, it will be understood by those with 
skill in the art that the scope may be performed within a wide and equivalent 
range of conditions, parameters and the like, without affecting the spirit or scope 
of the invention or any embodiment thereof. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 



NAME : Alanen-Kurki , Leena 

STREET: Piispankatu 6 E 31 

CITY: TURKU 

COUNTRY: FINLAND 

POSTAL CODE (ZIP): FIN- 20500 

NAME: Auvinen, Petri 

STREET: Kirschgartenstrasse 5 

CITY: HEIDELBERG 

COUNTRY: GERMANY 

POSTAL CODE (ZIP): D-69126 

NAME: Jaakkola, Panu 

STREET: Kellonsoittajankatu 13 B 20 

CITY: TURKU 

COUNTRY: FINLAND 

POSTAL CODE (ZIP): FIN- 20500 

NAME: Jalkanen, Markku 

STREET : Rauvolantie 

CITY: PIISPANRISTI 

COUNTRY: FINLAND 

POSTAL CODE (ZIP): FIN-20760 

NAME: Leppa, Sirpa 

STREET: Valkintie 14 as 6 

CITY: LITTOINEN 

COUNTRY: FINLAND 

POSTAL CODE (ZIP): FIN-20660 

NAME: Mali, Markku 

STREET: Inkereentie 176 

CITY: SALO 

COUNTRY: FINLAND 

POSTAL CODE (ZIP): FIN-24280 

NAME: Vihinen, Tapani 

STREET: Kaskenkatu 11 C 54 

CITY: TURKU 

COUNTRY: FINLAND 

POSTAL CODE (ZIP): FIN- 20700 

NAME: Warri, Anni 

STREET: Juhaninkuja 2 as 2 

CITY: LIETO 

COUNTRY: FINLAND 

POSTAL CODE (ZIP): FIN-21420 



(ii) TITLE OF INVENTION: SYNDECAN STIMULATION OF CELLULAR 
DIFFERENTIATION 



(iii) NUMBER OF SEQUENCES: 3 
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(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0; Version #1.25 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: WO TO BE ASSIGNED 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/988427 

(B) FILING DATE: 01-DEC-1992 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join (437 8. .4443, 22026. .22106, 23001. .23483, 

23905. .24039, 24251 .. 24418 ) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



TCTAGATATT 


CAAACTCACC 


AGATGGAGTG 


ATGTCCACCC CTATTGGTGG GAGTGACTAG 


60 


TCTTTCCTCT 


GTCTTCTGAC 


TCAGATGCTT 


AGCTAGCTCT 


TTAGGACCCA 


CCCTCACACC 


120 


TGCAAATAAT 


ACTTTATTTG 


CTCTCTTAGT 


ACCTTTAACC 


CAGTGGAGTT 


GACATGAGAA 


180 


ATTAACTACC 


ATAATTTATA 


ATATTTCATT 


TCATAAATGA 


AAAGTAAAAT 


AAATTAAAAA 


240 


ATAGAAAGGT 


TTGAGCATGA 


TGGCCCAGTG 


GTAAAGGCCA 


GTGGCTCCAA 


CGCAAGTCCT 


300 


GACAAATGGT 


AACGGGCCTG 


TTCTTCAGGC 


TTGAGGGAAG 


TTTATTGATT 


GAGGCTAAAA 


360 


GCAACCCAAA 


GGCTCCACTT 


GCCTAGTGTG 


A AGCCCTGGA 


TGTGCTCTCC 


CACACTGCAT 


420 


GTCCACCTGT 


GGTGTCAGCA 


CCTGGGAAGC 


TGAGGATGAT 


GGGGAGTCCA 


AGGTCATTAG 


480 


CTACATAGTA 


TAGGCTAGCT 


GGGGTACATG 


GGTCACAAAA 


AAGAAAAAAA 


AATAAGCACA 


540 


TTGTAATCCC 


AGCACTTGAC 


AGACCAATGG 


GGGGGGGATT 


GCTGTGAGTT 


TAAGACAGCC 


600 
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TGGCCTACAA 


AGAAAAACCC 


TACCCAAACC 


CAAGAAAAAT 


GAAACCAGTA 


ATATAAATAG 


660 


CTATTTTCAT 


TTTAAATGCT 


CTAAAGACAC 


AGCGTTAACA 


CAAAAGCTCT 


CGTCTGTGGT 


720 


TCCTATTCCC 


TCCTTCTCCC 


CCAGGTCTTC 


TTTAATGTAT 


ACTTTTTGTT 


TGCTTATTTG 


780 


CTTGTTTTGG 


ATTTTGGCTT 


TTAAAGACAG 


GGTCTCACTA 


TGTAGCTCCA 


ACTATTTGGG 


840 


AACTCACTAT 


GTAGACCAGG 


CTAGCCAGGG 


ACTTATAGAG 


ATCTACCTAC 


CACTGCCTCC 


900 


CAAGTGCTGA 


GACTAAAGGC 


ATGTGACACT 


TT6CTTGGTT 


ATTACAAACA 


TTTTAAAAGA 


960 


ACATTTTGAA 


CATTAATAGA 


TGTATGTATA 


TATATCACTC 


TATGTAGTAT 


ATATGTTAGA 


1020 


CATTTTTCAC 


TTGAGATACA 


TATTTACTCT 


CAAAATAAGT 


TTTTTGTTTT 


TTTTTCTTCT 


1080 


TTTTAAATTT 


ATTTTATTTT 


TTTTTTATTT 


ATTTTATTAT 


TATATGTAAG 


TACACTGTAG 


1140 


CTGTCTTCAG 


ACANACCAGA 


AGAGGGAGTC 


AGATCTTGTT 


ACGGATGGTT 


GTGAGCACCA 


1200 


TGTGGTTGCT 


GGGATTCGAA 


CTCTGGACCT 


TCCGAAGAGC 


AGTCGGGTGC 


TCTTACCCAC 


1260 


TGAGCCATCT 


CACCAGCCCC 


TTAAATTTAT 


TTTTATCTTA 


TGTCCATTGG 


TGTTTTGCCT 


1320 


GCATGTATGT 


GTAAAAGTGT 


CAGAAACTGA 


AGTTACAGAC 


TGTTGTGAGC 


TACCATTGTT 


1380 


. GTGGGTGCTG 


GGACTTGAAC 


CTGGGTCCTC 


TGGAAGAGCA 


GTCATTATTC 


TTAACCACTG 


1440 


AGCCATCTCT 


CTAGCCCTCG 


TTTTTTAGTT 


TTTTTTTTTG 


TTTTGTTTTG 


TTTTTTGTTT 


1500 


TTTTAAGATT 


TTCTTATTTA 


TTATATGTAA 


GTACACTGTA 


GCTGTCTTCA 


GACACTCCAG 


1560 


AAGAGGGCGC 


CAGATCTCGT 


TATGGATGGT 


TGTGAGCACC 


ATGTGGTTGC 


TGGGAATTGA 


1620 


ACTCCAGACC 


TTTGGAAGAG 


CAGTCAGTGC 


TCTTAACTGC 


TGAGCCATCT 


CTCCAGCCCC 


1680 


GTTTTTTAGG 


TTTTTGAAGA 


CAGGGTTTCC 


TGTGTAGCTC 


TAGCTGTCCA 


GGAACTAGCT 


1740 


CTGTAGACCA 


GGTTGGCCTC 


AAATTTAGAG 


ATTTGCCTGT 


CTCTCTGCCT 


CTCGAGAGCT 


1800 


GGGATTAAAA 


GTGTGCAGCC 


CAACAATCTA 


CTCAAAGTAG 


GTTTTGAAAA 


AGCTTTCCAT 


1860 


ATTAGGAGTT 


AACTAGCTTC 


ATTTCAGAAA 


TACTGCATGG 


AATTCAAATG 


TGGGACCATT 


1920 


CATAGCTACT 


TTGGTTTTCC 


TTCAGTGACA 


GGCATTCGGC 


ATGCCTATTA 


GGGAAGTCAA 


1980 


ATGGCCTGGA 


GAAGTCATCC 


TGGGTGAGAG 


GGCTAATGCA 


TTTTCAGCTT 


GACAGACACT 


2040 


GTCAACCTAT 


GCAGACAGTC 


TGCTCCAGCT 


CAGATGTCAA 


TTGCATGCAG 


ACCTGCAGTC 


2100 


AGACGCTAAG 


CTCCCTACCT 


ACTCTCCATC 


AGCTTAGATG 


TAAGGGGTGC 


TGGAACAAAG 


2160 


GCTCTCTCTC 


TCTCTCTCTC 


TCTCTCTCTC 


TCTCTCTCTT 


TCTTAGAATT 


AGTATTCTAT 


2220 


TTTATTTTAT 


GTAAATTGGT 


ACTTCACTTA 


CATGTATGTC 


CGTGTGAGGA 


TGTTGTATCC 


2280 


TCTGGTACTG 


GAGTTATAGA 


CAGCTGTAAG 


TCGCCATACA 


GGTGCTGGGA" 


ATTGAACCCT 


2340 
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GATCCTCTGG 


AAGAATAGTC 


AGTGCTCTTA 


ACCCCTGAGC 


CATCTCTCCA 


ACCTCTTGCA 


24UU 


TATTGAGGAC 


AGGGAGGAAT 


CACAAbuLA I 


V.7 1 A ww*o X btL 






2460 


ACCATAGCCT 


CCTTTCTTTA 


Tv?x owwl 11U 




PPTATAftf^Afi 


TPRTPTTPOT 

X WW X w X A WW X 


2520 


TGCCTCTTTA 


Affirm Am/i % Mtiti 

CTGTCTCATT 


GATCTGGGCT 


AAACTTATGC 


AGTTGGAAGG 


A A AP AHPPA AP 
AAA wA. 1 LAAb 




CTGGTCATGT 


TTAAAACATG 


AAACAGCCTC 


ATCAGTTCCC 


TTCCTGTTCC 


CGTCTCCCCC 


264p 


CCCCCTCCCG 


CCCCCATTTT 


GAGAGGACAG 


GAAGGTAAAA 


TACCAAAGTG 


TCCTATTTTC 


2700 


CTCCAAATAT 


CAGGCTCAAA 


GGACTGAAGA 


GCTGACTTCA 


GATCCCAAAG 


CCACTGTGTT 


2760 


AGGAGGCACC 


TGCTTTTTAG 


GTCCTAAGCC 


TTCCTGAGCC 


TtGCTATTGG 


GTATTCTTTA 


28 2 U 


CCAAGACCCT 


CAAGGATCTA 


GGCAAGAACT 


GGGCAGGATC 


TGTATGTAGC 


CCATAGTTAG 


/ooU 


ACCTAGG6CA 


GCTGAGACGC 


CAAAAGGGAG 


AGTTTCCTGA 


GGACAAAAGT 


GTTCAAACAC 


AA 1A 

2940 


AACTGGGTGC 


TGGTTGTTGG 


GCTACTCGTG 


GAGGTGTGGT 


GTGTGTAAAG 


GAGGCTGTTG 


3000 


AATTCCCAGA 


AGGCTGGTTC 


CACAGTGTAG 


AGTCTACACT 


GGGGACTTCC 


CGAGACGCTG 


3060 


AGCCTCAGAT 


CTAGCTTCTC 


AGTCCAGGCC 


AGCTGATGTG 


GGGCTGAGGA 


ACAAGGATGG 


3120 


ATGCCATCTA 


TGGCCCTGCC 


TTGCAGGTGC 


AAAGGGCCTT 


TGGCACCATC 


TACAGATTGA 


3180 


GGGCAAGACA 


GGGCTGGTTC 


TTCCTCCTTG 


CTCTCGCTGC 


TATCTGCCTC 


GCCTGTAGGC 


3240 


TCTCTGGGCT 


CCTTTTTGGA 


CTGACACGTC 


TGAAGGAGCT 


TGGAAACTGT 


GAGGTCCAGG 


3300 


CCCCATAGAG 


AATCATGAAG 


GAACAGGAAT 


TCAACTGGAG 


CTCCGCAGCT 


GGTTAGGCCT 


3360 


GCGGTCACCT 


GGAAACAAAG 


AGGCCATTTA 


TTTTTTCCTT 


TGGTCTTGGA 


CAAGGAAGAG 


3420 


AAGGGGCTTT 


CTATAAATAG 


AAAGACAGCA 


AAAAAGAAAA 


TAATAATAAT 


AATAATAATA 


3480 


ATAATAATAA 


TAATAAAAAC 


AATAACAAAG 


CCAGCTCTTC 


CAGACAGTGC 


TCATGTCTTT 


3540 


AAAGGTCTTT 


AAAGGTCTGG 


AGTTCCCAGC 


AATTAAGTAA 


AGGACCAAGA 


CCTCAGGGGT 


3600 


CCCCTATCCT 


CAGCCCGTGG 


GGAGGTGGGA 


ACCATACATC 


GATCCCTCGG 


TTTATATATA 


3660 


GCCTCATCGC 


TGTGGGGCTC 


CGAGGTTGCC 


CCCAAAATCT 


TGCTCACCTG 


GAGGACCCCT 


3720 


GGGTGTCCTC 


GCCCAGAGGG 


CGCTGCAGCC 


TCGCACGTAG 


AGAACTAACA 


TCGCCCTTCT 


3780 


CCAGGGCAGT 


GCCTCCGGAC 


TCCGGACCAG 


GACATAGTAG 


CGAGTGCACC 


TGGGTCTCCG 


3840 


TCAGCTACGC 


ATCAAGGAAG 


GTGCGACGCG 


GGAATTACAG 


ATTGCCGGCA 


CTCACCAGTG 


3900 


CTCAGGGGAG 


GAAGGTGGGA 


CTCAGACCTG 


CAAGAGCTGA 


AGAGTGGGGT 


GGGCTTCGAT 


3960 


CCTAGGAGGC 


GTGGAAGGGG 


GTGTGGCTGG 


ATCCCTGGGG 


GGTGGGGCGC 


TCCAAGGGGC 


4020 


GGGGCAACCC 


AGGGGGCGGG 


GCCCGAGGGG 


TGGAGATTGG 


GACTACCCAG 


GCCCGCGGAG 


4080 
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CTGGGGGTGG GCGGCTAGTT TTGCAACTGC AGAGCCTTTG GGTTTATTAT AAGGCGGAGC 4140 

TCCGCGGGAG AGGTGCGGGC CAGAGGAGAC AGAGCCTAAC GCAGAGGAAG GGACCTGGCA 4200 

GTCGGGAGCT GACTCCAGCC GGCGAAACCT ACAGCCCTCG CTCGAGAGAG CAGCGAGCTG 4260 

GGCAGGAGCC TGGGACAGCA AAGCGCAGAG CAATCAGCAG AGCCGGCCCG GAGCTCCGTG 4320 

CAACCGGCAA CTCGGATCCA CGAAGCCCAC CGAGCTCCCG CCGCCGGTCT GGGCAGC 4377 

ATG AGA CGC GCG GCG CTC TGG CTC TGG CTC TGC GCG CTG GCG CTG CGC 4425 
Met Arg Arg Ala Ala Leu Trp Leu Trp Leu Cys Ala Leu Ala Leu Arg 
1 ~ ~ 5 10 15 

CTG CAG CCT GCC CTC CCG GTGAGTGTGG CCCGGGGCAG GGCTGGGAGG 4473 
Leu Gin Pro Ala Leu Pro 
20 



CGGCGGGAAG 


CCGGGACTCG 


CCACTCGCCG 


ATGCCATGCA 


GGCGGCAGCA 


CGTGGAGGGG 


4533 


GAGGGGAGCG 


GGGACTTCTT 


CCCGCGCTGC 


CTGGCGGATC 


CTGGGATGGT 


GAGCCCT(TTA 


4593 


ATGAGGACTC 


CTGTCCCAAT 


TCCTCTACGG 


TCCGTGGATG 


CCAGGAGGCT 


ATCCCAGCTC 


4653 


GTGGTCCGGG 


CGTCCTGCAG 


AGTGGAACCT 


CCATTGGTTC 


CCCGCTCCCA 


ATTAAGTAAA 


4713 


ACGACTCCAC 


AGGGGTCTGA 


GTCGCCGGCC 


TTAGGCGCTC 


CGCCGGCCTT 


AGGCGCCGCT 


4773 


TGGAGTTGCT 


CTCTCCCGTT 


GCTGTCTTGC 


TGGCCATCTC 


AGCGGCCTGG 


CCTCCGCCAG 


4833 


TGTCCCGGAG 


GATGCAGTGG 


CCATGGCCAA 


ACGCCTTTTC 


CATAGACCCT 


AATTCAAACC 


4893 


AGACTGCAGG 


CTGCACCCCC 


AGCGCCGCGG 


AGTCCGGGCG 


CTCGGCCCTT 


TGCACCGGGG 


4953 


CAAGTTTGGG 


CACAGCAGAG 


CCGGCGCGGG 


AACAGGGGGA 


AGCTGACGTT 


CGGGGTGGCG 


5013 


GGAGGGACGG 


GATTAAGGCT 


GTTTGTGGGA 


CACAAGAGGG 


TGGCTCAGGG 


ACTTCGGTTT 


5073 


TTCTCTGGCT 


GCCCCAGGTG 


AGCCGGGCCG 


AGCTGGCAGC 


GGGAGGTTCC 


GGGAAGTTGG 


5133 


CTTCAGAACG 


CTGAAGACCC 


TAAGAACCCA 


ACTTTGGGGT 


CGCTGAAGTT 


GTGCTGCCCC 


5193 


CGGAGGGCCT 


CCTCCGCATG 


GCCCGCGCGG 


GGGACCCTCC 


CCGCGAGTGG 


ACCCCGGTAC 


5253 


GGCTCTTCCC 


CTCCCCCGAC 


TCGGCTTTGT 


GCTGAAGCCG 


CGCGTAGGGA 


AGGCGGGTCC 


5313 


CTTGGCCCGC 


CCAGTAGGGC 


GGCGGGGAAA 


GAGGGACGAA 


CGTGGAGCTG 


GCGACTGGTG 


5373 


GGGGAAGCTT 


CTGGGTAGGA 


TGCAGCCATC 


CACCTTTGGT 


GGGGTCGGTC 


TCTCTAATCA 


5433 


GCGGCTTGGC 


GACAAAGAGC 


TTGGTCGAGG 


GTACCCCAGA 


AAGTGCTCTC 


CCGCCCCAAG 


5493 


CCGCCGTCGC 


TAGCCCGCCT 


TCCCAACGGG 


CGCTTTGTTC 


TCGGCCCCTG 


TAACCCTTCC 


5553 


CTGGGAACCG 


CCCCGCAGCG 


CTGGTCCTTG 


ACGTGGGCCG 


GGTCCTGGGT 


CGCCGCCAGT 


5613 


GTCAGCGCTG 


CCCTCCGGTG 


TCCACGCCCC 


TAGCCCCCGC 


ACCCGCTGTG 


AAGTCCCGGG 


5673 
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TGTCCTTTCC 


ACTGGCGCTT 


TGCCCAACCC 


CTGGAAGGCA 


GAGGCGAGGT 


GCGGAGCCTC 


5733 


AGGCTTTATC 


CTCCCGGAAG 


TGGCAGTCTC 


CCACCGCCAC 


ATCTGGTCTG 


CTTAACTTCG 


5793 


ATAGTCCTGG 


CAAAGGCAGA 


CACGTGCACA GGGAAGGAGA GTTGAGCGCT 


GGTAGATACC 


COCO 

5853 


AAGGTCGTGT 


ACAAATAAAG 


TGGCACACGA CACGTCCCCA GTCACTGTTA ATGCATTGCC 


CQ1Q 


TTCGCTCCTT 


CCCAGGTGGC 


TGGTGCTCTC 


wA iLAL 1 w 1 


n a cppp A A G A 


GAGGGCCTCC 

wnwvVJvv «*■ ww 


5973 


ATAATTGTAT 


TGCCCATGAG 


TTGGGGTTGT 


GTGGGGGCGC 


p & & a tp a dflfS 


T TP TP TGGG A 


6033 


GGGCTATGAA 


TTCCGAACTG 


AGTCTCCTGT 


GCACTCCTGG 


/•^TTTIV Aflf2TT 

Oil 1 >V/\L7^JT 1 X 


PAAGAAATTG 


6093 


TTTGAGGGTT 


GTGGTTTTTG 


T GGG ACTC AG 


7v mm is rrpr'p'Pr 
ATTATlj LA* I o- 


P 21 & TP A T ACT 


TAPPACTGTG 


6153 


GAGAAGAAAG 


TGGAGCTACT 


TAGCATGCCT 


CCCCGGCCCG 


CCTGGCATTA 


CCTCCGGCTC 


6213 


TGTTCTCTAG 


GCCCAACGTG 


AGGCCTCACT 


GGGGCAGTAC 


AGATGCAGTA 


CTGAATTTCT 


6273 


TTCCAGCCAG 


GATCTGGAGA 


GGTGGTGTTG 


TCTTCCCTGG 


TGTCTTTAGA 


GAGGCAGATA 


6333 


TTCCTGTGAC 


CTAAGCCCCT 


CAAGCACCCA 


TTAATAATGC 


TGAGTAGACA 


ACTAGAGGTG 


6393 


GCGTTTTCCG 


GAACTTCCTG 


TGTGCTGGCC 


TGGGAGGTTG 


AACCCTCTAG 


GAAACAGGTC 


6453 


TAGGAAGTAG AATTATCTCA 


ATGGAAGGCT 


TCCTGGAGGA 


AGAAGATGAG 


CTGAGCCCCC 


6513 


AGGTCACTGT 


CTGAGCTTTA 


GGATCAGACT 


CCCACTTGGA 


GGCAAGAGTG 


TTCGTTTTAC 


6573 


TTTTTTTTTT 


TAAGTTTAGT 


TTATTTTCTC TCTAACAGAA AACAAACAAA CAAACAAAAA 


6633 


AAAACCCCAC 


ATTGTTTAAA 


AGTGGGTGCA 


TAAGAGTGAG 


GACATATTCA 


GAGCTTCCCC 


6693 


TTTTCCTGAA 


AAATGAAGGC 


AGCTGGGATT 


TACTTAAAAT 


GAGAGCACAT 


ATCACAATTG 


6753 


CCAGAGAGCT 


GGTCCCTTTC 


TCAGGGCTCC 


CTAAGCTCCT 


GTGGGAAGCA 


GGTCAGACAG 


6813 


CCCTGGGGAC 


CAGAGAGATA 


GGGAGTGCTT 


TTGGGTGCCT 


GCCTTTGAAT 


GGGGAAGGGG 


6873 


GGGGGAGCTG 


CTGGGATCAG 


AGGCTGCTAG 


CAACTACTCC 


CCAGAGACTG 


AAGCAGGTTT 


6933 


GTCCCTCAGT 


GTCCTGTGGT 


CTTCTGTTTC 


TCCTATATAG 


AATAGGAGAA 


ATGGTTATTT 


6993 


GCTCTGGAAT 


AGTGACTTGC 


TATTTGTTCC CTTTCTTTCC 


TCTCCCTTAC 


TGTAATCATT 


7053 


TGGACTAGTA 


GAGACACTTT 


CCCCAGGTCT 


GGCAGAATGG 


GAGGGAGTGG 


GGGAGGCCTG 


7113 


TGCTTGCATG 


ATGTCACTGC 


TGGCTTCAGC 


TCTCCAGGGA 


GGGTGGAGTT 


GGTTGTAACC 


7173 


TACCTGTGGC 


TCTTGATGGG 


CCACAATAAA 


ACCTCATTAA 


CACACATTGG 


TAGGGAGAAG 


7233 


GGACTGGAAA 


GAATGATGGG 


AAAGATTGAT 


GTTTTTCCTT 


TTTTTTT^TTT 


TTTTTTTTTG 


7293 


GCAGTACTTT 


CTAGATCTCC 


CCTCCCCCTT 


GCTGCAGCAA 


AATTTTGGAT 


TCCTGAAGTC 


7353 


CTTTGAGAAT 


GTATAATGGT 


AGCCAGACTT 


TTTTTTTTTC 


AGTCAGCTCA AAATTGCCTC 


7413 
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CTTATAAAGT 


ATCCTTGGTT 


GTTTTTTGTT 


GTTGTTGTTG 


TTGTTGTTTT 


GTTTTGTTTT 


7473 


AAGACAAGGT 


TTCTCTGTAT 


AGTCCTGGCT 


GCCCTGGAAC 


TCAATATGTA 


GACCAGGCTG 


7533 


GCCTCAAACT 


CAAAGAAATC 


CACCTACTTC 


TAACTTTCAG 


TGCTGGGCCT 


AAAGGTGTAG 


7593 


GCCACCAAAA 


GTGCTCAACT 


TTTACAAAGC 


AGTCTTACTT 


TGAGCAGGAT 


TCTGAAACCC 


7653 


TTATTTCCTT 


TCTGTTATCT 


TCAACAATAC 


ACTGCTAGGT 


GTATTTAGTC 


CCTCATGATG 


7713 


CTGGGCCTCC 


TCAAGTGGCG 


CCAGGTCAAG 


CAGTCTCCTG 


GTTTTTGGTG 


GCTCTGAAGA 


7773 


AGACTGTGTC 


CCAGTGACTG 


GCAGTTTGAA 


TTCGGAGCTT 


CTCTTTTCCT 


TCTCAGTCTT 


7833 


TGGCAGGCAG 


AGTGACACTG 


GTGTGCCCAA 


GCCTGGAGCT 


TCTCTGTTTA 


ATTCTAGTTT 


7893 


ATTTTCTTTA 


TCAGACTGAA 


AAACAAATCA 


GGTTGGTTAT 


AATTCTTATA 


AACACGAAGG 


7953 


TCTCACCTTT 


GCGTACGTCT 


CCGGCTGTGT 


GGGTCTGATG 


TCCCTCGGGA 


ATCTCTGTTG 


8013 


AGGCTGCTGC 


AGTGTGTGTG 


CGTGTAGAAA 


GGGCAAGGTA 


GAATGGACAG 


AAGCGTGCTG 


8073 


CCCACCCCAC 


TGTCCTGTTC 


CTAAATGATG 


AAGCACTGGC 


CCGGTGAAGA 


GCCTAGAGAA 


8133 


CTCCCTCGGT 


GGGAGATGCA 


CACAATGCCA 


GGAAGCACAC 


AGGAGCTTGA 


GTTCCAGCTT 


8193 


GGCAGTGTCT 


TCTCTTTGGT 


GACTTTATCA 


GCTCCAGCTG 


CCCTGGACTA 


ACAAACAAGG 


8253 


CTAGCTCACT 


CTCAGTATTG 


ATAATCGAAG 


GTCCTTGGTT 


CTGTTTGAGA 


CTGATCCTCA 


8313 


CTCGGTAGCC 


TTGAACTCTT 


AGCAATTCTC 


CTGTCTCAAC 


TTTCAAAGAG 


CTGAAATTAC 


8373 


AGACTCGAGC 


CACCATATGC 


GACTGAAACC 


TTGTTCCTAA 


TCCTTGACTG 


TGAACGACTC 


8433 


TTGGGTTTGG 


TTCTTTCTCC 


ATTTCTTTAG 


TGTATGTTTT 


AGTTCGCGTC 


CTACATAATC 


8493 


TATTGCCCAT 


ACTTAGAAAC 


AACAGGTTAG 


AGACAGCATT 


GGGTCCAGCA 


GAGCCTCACA 


8553 


CTGAAGCTCA 


GTCCTGCCAC 


TGATTTACCG 


TGTCAGCTCA 


AGTGACTCAC 


TTCCAACTCC 


8613 


TCTGCTCCCC 


ATCTGTAGAG 


TAGACATCAC 


CATACCTGCT 


CTTTCTGCCC 


ACATTCTGTC 


8673 


ATTAACATGT 


TCATTTCATA 


ACGATGGTGC 


AAAAGTGCTT 


TGTAAGTAAA 


GTGCTGGGGA 


8733 


AATGTTAGCT 


GTCGATAATG 


GTTAGGGTTA 


ACTTTTTATT 


GAGTGCCTGT 


TGTGTGTGGG 


8793 


GTTGGGTGGG 


GTTTTTTTAG 


AGGCTTGGTA 


GTTTTCTTAC 


TTCTTTCCTA 


CTTAGCTTTT 


8853 


CTTCCTAAGC 


CTTTATGGTA 


TGTATCATTG 


CCTGATTGTT 


TGAGTGTGTG 


CACTGAGGCA 


8913 


CGCCTGTGCA 


TGTTTGAGAG 


TATGCTTGTG 


CGTGCTCTCG 


TGCTCACATA 


TGTATGGTGT 


8973 


GAATACACTG 


TAGAGTGCAG 


GCCGGCACAC 


TGGGGCTGGC 


TGAATCCTGT 


GAGCCCTGCC 


9033 


TGGAGTTTGC 


AGATCTTCCT 


TGGACACTCC 


TGCTTGTGAG 


CATTTTGTGT 


GGAGTGACTG 


9093 


TTTAGCTGGC 


TGTAGCCTAC 


ATTGTGCCTT 


TGGGTAAACC 


CTGAGTATTG 


GGAAACACCC 


9153 
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TGGGCTGTGG 


CTGTGTGTGC 


CCGACGGTTG 


CTTGGGTACA 


GCTAAGAACT 




QUI 


AGTTGAGCTC 


ACATGCTATT 


AGTATTAACT 






TGGGTGGTAC 

X V7V7W A V?^J X «TO W 


9273 


CTGCTTGCCC 


TCTCATGCAG 


TTTATCTTGA 




LnUnLX 




9333 

7 www 


AGCTTTTGTC AGCCCTGGGA GGTGGGTTTC 


GTGGCCACAA 


GTGGGTAGCT 






GACTCCTGGC 


TTCTAGGTTG 


CATTCTCCTG 


TGGTTCTTTC 


CAAGGGAATG 


CTAGGGGAAC 


Q /CO 


ATTTTGGACA 


TTAGATTATT 


TCTAGTCCCA 


AAGCACACAG 


AACATACTGT 


TTCCTAAT1 




CCTTTTTTTT 


GTTTTCCTCT 


CAATCTGGTT 


TTGAAGTGTT 


GGC?ri ibAAA 


Ail bLUUUU 1 


V*J / <m? 


GAGAGCCTGC 


CCTAGTGTGT 


GCAGAGGGAA 


GATAGTGGAA 


CAGGAAGTCT 


1 AuAAAb 1 A 




TCTTCCTTTC 


CAGGACCTTG 


TGCCCCGGAG 


Mm — . MM ^t* A Mm A 

CAGAGTCAGC 


ATGGTGTCAT 


ATCGCTTTTG 




GCTATTCCAG 


AAGAGATGAG 


GTTTTAGGTG 


AGAATGAACC 


TTTTAGAACC 


TTCTAGAACC 


9753 


TTCTGTTGAG 


TATGACAGGA 


ATGCCCTGAA 


TAGGGTCCGA 


AGTGCATGGC 


CACTTGTTTG 


9813 


TCTTTTCCAT 


AAGCAAGCAG 


CTTCAGGTAC 


AGACAATAAG 


ACTAGGTTCT 


TGGAGTGAGA 


9873 


CCCTGCACTT 


GGTGCCATTT 


CAGCTCCAGA 


TGGACACTGG 


AGGTCCCTAC 


ACAGCAGGCT 


Art AO 

9933 


CTGGGATGGC 


TGGCTTTGCT 


ATGTACTGTT 


GCCTGCTCTA 


CAAGAGCTTC 


CCAGGTTACT 


9993 


AGCCTTTGTC 


GACGCTGGGC 


TCGCTGGCCA 


GGCTTGGGCA 


TTGGAGAAGG 


GACAACTTGC 


10053 


CACCTGGCAT 


AGGCTGTGTG 


TTTGGAGAGT 


CAGGAGGTCT 


GGTGAAGCCC 


GCAAGTGGAG 


10113 


GCAAGTTTAG 


TGGGACTTGA 


GGAGAGCTCA 


GTAGGAAATC 


TCTGGGCTAG 


TGACAGGCAG 


10173 


GTGTGGTGGT 


GGTGGCGAGG 


TGGCGGGTCT 


AGATCTCCTT 


MM MM A ^n. A MM MM MM 

TTAGAGATTT 


GCCTAGGGAT 


10233 


CGTCCCTGCT 


GACTCTGGAA 


CTCAGAGGCC 


TCCAGAGGTG 


TCTCCTCTGG 


GAGCCTCTCA 


10293 


AGGGTCTCCC 


ATCTCCTACT 


GTTTATGGCT 


TTGTGGGCTA 


CCTAATTACA 


TAGAGAAGAT 


1 A. A C A 

10353 


ATGTTCCTCT 


GCCTCCAGCC 


CTGGAAAGTT 


CTGCCCAGTG 


ACTCACCTGA 


<MH Mm Mm MM <*M* **4l <ft ^4i^4^4i 

GCCTGCAGCC 


10413 


ATGTGTGTAC 


ACAGGCGCTC 


TCAGGGGCTT 


CTGTCCTGCT 


GGCTTCAGCC 


*H MM MM JMH MM 4k AJ^i^Jtl 

TTTCTAGCCC 


10473 


CTGGTGTTCT 


CGGCAGTGGT 


AGCATCTGGG AAACCGGGTC ACCTCTTATT 


TGCAGCTCCC 


10533 


TCCCTTTCTT 


GGTGTCTTCC 


CCCTTTTTAA 


CTACTGGTCT 


GATGGCCTTA 


MB M Mb* MM Mm M, MhAlMfM 

GACTCATGCT 


10593 


GAAATTCTCC 


TTTCTTTTGT 


CCTAGCCTTG 


TCTCTGACTT 


CTTGTGATCC 


TCTGGGCCTG 


n ^ co 

10653 


TGAAATCCGC 


TCAGGGGCCT 


CCATTTCTAA 


CAGTCACACA 


CTGGTGGAGA 


GACCGAGTCC 


10713 


TGGGATGGTG 


AAGCTAACCC 


TGCTGGGCTT 


CTCAAGCTTC 


ATTTGGTTTC 


TCTTTATTCC 


10773 


TTCTGGAGGT 


ACTGCCTGCC 


CCAGGGGAGT 


CTCAGACTAG 


ACCACTCTGG 


AGTTGGAGGT 


10833 


GGGGCAGGTT TTCAGATCAG TGCCCTTGGC ATTCGTTGTG GGAATGGGGT GGATGGGGCC 


10893 
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fri m /** K* /"^ Tt it 

TCTGGGCAAG 


t**mt** n /*^ r> *** m a"i /-^ 

GTCAGGCTGG 


GGGTGGAGGC 


CAGGTGATGT 


TCTCCGCACC 


x x ***r>r* x p*p» 

CACACCCAGG 


J.UVDO 


» ^* m ***) *** *%. 

CAGCCTGGCA 


CCCTCCCCAA 


GGTCCGCTCA 


in r~* « it it it 

TCAGCAGGAA 


TGAAAGCAGT 


*~> fir* r***" it r+r* 

GCCGGGCAGG 




TTGGGGCAGT 


GGGCAGGTGG 


GCGTGTTTAT 


CGCTGTGCTC 


ATCAGCTGAG 


TCACGATGCC 


11073 


AGGCCCCACA 


AGTCCTCCCT 


GGAGGCTCAC 


CCCACCCACC 


mm^i ********* « ^yt*^ 

TTGACCCACC 


» » ********* % ^« m 

AGCACCCACT 


11133 


AGCAGGAGGT 


AGGGCAGGGC 


AGTGAGACAA 


GACCAGCCTG 


GGGGTCTGAG 


AGGCAAAGGG 


ini pi 0 
11193 


GAGTTGTTCA 


m****\ a> m r* *** r* m 

TGACCTGGCT 


GTGCATGGGG 


ACTTGTGGGT 


GTCTCAGATA 


TCTCTGCTGT 


11253 


p*p» x c*r* x r*r* n x 

CCAGGAGGAA 


GCTGTCTTAA 


GTGCCAACCT 


GCCTAGAGCC 


P»P , rnp»p»rnp^p*riai 

CCTGCTGGGT 


P"/*i x P^o x x x mp 

GCAGGAAATG 


11 Jlo 


OACAAGGGAG 


x p<rpp*p»p^p»x mp* 


I GGAA I AG 


r*r % c*r t x mpp x 
bLLuA 1 GGAG 


PTIRPTlPPRPfTl 

U 1 AGAUCAGT 


PUPHPrnpRPA 

GACAGTGACA 


1 1 Q*7Q 

lxo /o 


GTGAAGTUAG 


r*t************\ ppmp 
CCCCCACCTG 


TGTCTTCCGA 


c*c*r^ x pprnpp x 

GCCAGCTGGA 


GGGTTTTTAT 


rtmpi p»x rnmp^fTi 

CTCAGATTCT 


114oo 


» •» ****** it m 

GCGAAACCAT 


%n« » mpim it p*m 

AGAATCTAGT 


CAGGAGCCTA 


GACTGCAAAG 


CAGGCTTCGT 


mo it mPPmmmit 

TGATGCTTTA 


11493 


% /"* m m n ********* 

ACTTGCAGGC 


TTCCTGGGTA 


TGAGGGATAC 


TTAGAAAGGT 


CCCGCAGGTA 


GGGAGGGCAT 


11553 


*** » ****** % it Aim » 

CAGGAAGTAG 


AAGAGGGCCA 


^1 *m\ m m r^n -m 

GGCACTTCTA 


TCTCCTGCAT 


TGCCCCCTTC 


TCCCATCTCC 


11613 


x it *** it m *** *~> m » 

AAGGATGGTA 


AAAAGAACCC 


TTCCAGTACA 


CTGACAGAGA 


GGAAAACCCT 


TCATCTCACC 


11673 


*^**+ *\ mmm^o it m 

CCATTTGGAT 


CTGTCGTATC 


AGCATGTGCT 


GGCCCTGCTT 


CCATACCAGA 


GGTGGCTAGA 


11733 


GATGTTCCCT 


GGGAATTCAC 


TGGTTGGGGA 


CTTGAGTGTA 


TCAGAGGGGC 


ACAAAGTAAC 


11793 


X mm it it AUnOm^ 

ATTAACTCTG 


GTATCCTCTG 


CAGCAAATCG 


GAGATCCCCT 


CTCCTAGGCG 


AGTTCTCAGT 


11853 


GGATATGGAG 


A^mAnt pnfnnim^ 

GTCAGGTTTG 


GGCTTGTAGG 


GCCCCAGCAA 


GAGTCGTTGA 


TGTCACTCCA 


11913 


GCTTCTCCCG 


X P* X mAi X 

AGGAAGATGA 


GGGTGCTGTG 


m m 0 it m^i it 0 

TTGGGATCAC 


ATCTCTCCCT 


GAATGGCATG 


11973 


TTGGGGAGGG 


It mpp ft nppnm 

ATGGAGCCCT 


fPPPrnrnpriip x 0 

TGCTTCTGAC 


prtrtm iv it r*^min 

CCCTAAGCTT 


GGTCTTTAGG 


m************ x /■"» it r^m 

TGGCCACAGT 


lzOoo 


U rt/TGGGTTC 


TGTCCTACCT 


CCCTGCCCTT 


PfTIPRIPPRimP It 

GTGTGCTTCA 


x it ****^** wm > it m^ M m 

•AAGGCATGCT 


xxx * m ** m *t* m >*\ nmp 

AAAGGGACTC 


12093 




GGAATGGwAU 


AGTGTTCCTT 


Pmpfnmpmppp 

CTGTTCTCCC 


ACCCCCAGAA 


****** IV **%**% f*\ X ********* 

GGAGGCAGGC 


12153 


GGATTGT A 


GATTO^TAGA 


AGTAAGTGGC 


CCTGAGCATG 


CTGTTGATGA 


» *~* f^m/"* /** it x pip» 

ACCTGGAACC 


12213 


it r******* it /i /~>< 

AGGCAGGCTG 


GGCATCCTAG 


GACCTGTCTT 


TCCATAGAAG 


TCTGAATCAG 


TCTACCTTTG 


1227-3 


GGACTGAGTA 


AGGGGCTCCT 


CACATATCAG 


CTGGCTAGTC 


CATCTTGGCT 


GATCTAAACC 


12333 


ACATTAGGCT 


GAAGAGAAGC 


ATGGTGTACA 


GTCTGGTCCA 


CCCGAACCAC 


ATACTGGCTT 


12393 


TATCAGTTCT 


CGTATAATTT 


TGCAGGTAAC 


TTTTTAGCTC 


TAAGCCTGTC 


TCCTCATCTG 


12453 


TGAAATCGGG 


TCCCTCATAT 


CCTGCCTAGA 


AGGGCTTTTG 


AAAAGATTAA 


TGAAGTAGTA 


12513 


TGCCGAGTGG 


TTGGGGTTCT 


CTCCTTGACT 


GGAGCAAGTC 


TCTAGGAGTA 


CTAAGGATAG 


12573 


CCTGCTGTGT 


GCAGCACCCC 


CAGGGACTGT 


GCCTGAGTAG 


GAGGGTACAG 


AGTCTTCATG 


12633 
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TGAATGGCCC 


TTCTGGTCTT 


GCCCCGAAGT 


TAGTGTTGAT 


GTCATAGAGT 


CTACAAACAT 


12693 


GCCTTTTGTC 


CTTCCTCAGA 


AGTCCAAGCC 


TTTCCTGGCA 


GACCAGACAT 


TCATCTCCAC 


12753 


TGAGCCTCTA 


TGTGAGACTG 


GCTCCTGGCC 


TGAGCTGTGT 


GGGCTGAGCT 


GGCGAATGGG 


lzolo 


AAAACTAGAC ACCTGGGCAC CTGGGTGGGG GCTCGGGACA GCAGTGTTTC 


AGTTGTAGGC 


lZo/o 


ACTGTGCCCC 


TGCCTGGAGC 


TTCTGACTGA 


AGGTTACCCT 


nrACAftftAAnp 


AGGTTCCCTA 


12933 


TAGACACTAA 


CATAGCTGGG TCAGAGTGCA AGGTGGGTGT 




TGACCCATTC 


12993 


AGTGCAAAGG 


CTGCTCTTCT 


GGGAGTGAGA 


GCTCTGACAG 




GCCGAGGGGT 


13053 


CTCAGAGCAA 


ACCTGCCTGG 


CCTCTCCCCA 


CTCTGATGGA 




TAAACAAGTG 


13113 


ACTGTCCACT 


TTGCCTCAAT 


TTCAACATCT 


GTAAGATAGA 




TGGTCTGAAA 


13173 


ATGGTTTTAA AGATTAGTTA GCTAATACAG GGAAAGTGCT 


CTGAuAGGTA 


CCTGGCACCT 


13233 


TACTCAACAA 


GTGGCTGGAG 


TGCCTGATTT 


CCTAAGGTCT 


GGACCTGTCC 


CTATGCTTCA 


13293 


AGTGCCCCTA 


CAGCCTTGGT 


CAGGCCCTTA 


GGTTCTCCCA 


CCC ACCGCT G 


GCCCCAGGAC 


13353 


CTAGACTGCT 


GGACCCTGAC 


CCCATTTTTC 


CTTTAAGCCA 


CCTCTGCGTC 


AACTCTAAAA 


13413 


GGCGGTGGAG 


TTGTTTATCT 


AGGCTGTGAG 


GTGTCAGAGA 


AAGbAUt* TCyVj 


GCCGCTTTGT 


13473 


TCCTGTGTGG 


GCTGGGGCCA 


CTCCAGGAAC 


TGAGAAACCC 


ACCCACCTTT 


TCAAAAACAG 


13533 


CCTCTTCTCA 


GAGTCTGGCA 


CCTCAGCTAG 


CCACCATGCT 


GTuGGAOOAu 


TCCCAGCATG 


13593 


CTCTGCCTTT 


GGTTTGTTTC 


CCAGGGGCCT 


CAGTGCCTTT 




CAGGCATCTT 


13653 


TAGTTCAAGG 


GGAAAGAGGA 


AATGAAGTGT 


ATTTGCTGGT 




CTGTCACTTG 


13713 


CATTCTCACA 


GAGGCTAAAG 


AAATTTGCTC 


TTTGTATCTT 




CTTTATGATC 


13773 


TTTTCCCATC 


TGTTGTATCC 


CAACTGCAGG 


GCCCCAGTTC 


TAfSA ATTAf^P 


CCCTCCCCCA 


13833 


TAGGAAGCCG 


ACTTATGCTA 


TAATGTGAAT 


GACAAGTATC 


LI 1 1 X 


TCCCACAGGC 


13893 


ATTTTAATTT 


TCAAAAGGGC 


ATTGCACAAC 


CGCAGAGACA 


CTAAGAAGAG 


AGGTTTGGTG 


13953 


ATCAGAGTTA 


CAGCCCCAGC 


CTCCCAGCTG 


GTGGCCCGGC 


TGGTGCAGGT 


GTGTCGAAAG 


14013 


CAGTAGTTTC 


TGCTTCAGTG 


AAACTTGAGG 


ATCCTTTATT 


TAGCCAGTTC 


AGGGGCGGAA 


14073 


TGGCCATGCG 


AAGTCTATGT 


GTCACAGGTG 


TCAGGCCCCC 


ATATCCTGCT 


GAGTCTAGAA 


14133 


TCAGCTACGT 


AGCAGTTTTG 


GGGGTATTGC 


CAGACTGGGA 


GTTTACATCC 


PACAAfSPfiAf; 


14193 


AATGGTGGGG 


TTCCTATACT 


GCTCCAGACA 


GGATCTTTCC 


CCCAAGTTTG 


TCAGCCACCT 


14253 


CTCTTCAAGT 


CCCTTGGCTC 


TGACCAGCAA 


GACGTATCCA 


AAAGAAACTG 


aggaggccct 


14313 


TCACTTCTTT 


TTAGGATAGT 


GTGGGGCCAG 


CATGGTGGGG 


GTTGGGAATG 


GCTTTCTGTC 


14373 
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TCTTCCATCA 


TCACAGGCTA 


CTTCCCAGAG 


ACACTTTGAT 


TCTGGGCATC 


TCCAGCAGTC 


14433 


ACCTGGCCCA 


CAATGCTTTG 


CTGCCCTTTG 


CTTCAGCCAC 


TGTATCTGGT 


TGTCCCTTGA 


14493 


AGGTGAGCCA 


GAGCTCCTAG 


GCAGAGAGCA 


TGTGCTATAC 


AAAGCCGTAG 


GCTGGGCCCT 


14553 


GGG AACCTTC 


inifi/l/VmMMiM« mm* 

TTGCTGTCAT 


CCTCCTGTCA 


AACCCCTATG 


GTATGGTAGC 


CCACATAAGG 


14613 


CTTGTGCAAA 


AAACAGGCCA 


AAACATAAGT 


TATCTTTTCA 


CTCTATCGGG 


TCTTCTCATT 


14673 


TTCCCATGGT 


ACGTTCGGCT 


GGCCAGGCCC 


AAAAGATTTG 


AAGAGAGGTG 


GCTGGCAAGT 


14733 


CTAGGGGAAT 


AGGTCTATCT 


GGTTCCCTCC 


AGGAGCAGTG 


CCTAGTGAGA 


GGCTGGGCTG 


14793 


GGCAGGGCAG 


GGCCCCTTGC 


mm* m^m jmm m ^ mm* M JAM JM 

TCCACATTGC 


CTGAAGTCCC 


^* mm* mmm mm* mm**** mm* *m 

GCCCTGCCCG 


TCCTGGCTGG 


14853 


GATCTGGCAG 


GTCTTCCAGC 


TCCACACCCG 


GCTCTCAGCT 


GAGCCTGCTC 


AGAGACTAGT 


14913 


CCTGGCATGT 


GGGTTGCAGG 


GCTGGTTCCA 


GCTCCACCAG 


GAGGTATGGG 


CGTCTGGGTA 


14973 


CTCATGGGAC 


ATTGACCTGT 


AGTGGGTATG 


GAGAGTGGAG 


GAATGGTACA 


GGCAGGTGTG 


15033 


CTGGTGCTGA 


CGGACTTGAC 


TCCGGCATTG 


ACCTTGGCTT 


GCAGTCTGGT 


GTTAAACTAA 


15093 


CAGGGAATGC 


TGACAAAAAA 


GACAGTTATT 


AAAACCAAGA 


CAGGATACTG 


CTTTCCCACT 


15153 


CAGCCCATTC 


CCAAGAATCC 


CCAAGACGTA 


CAGGAAATGT 


GCAACAGCAG 


TGGGAATTGC 


15213 


TGAGTTGGGG 


GATGTGGGTG 


AGCTGTGTGC 


TCCCAGGGAA 


TTTTGGGAAA 


TTCCCCTCCG 


15273 


TTGAAATGCT 


GTCAGGGTCT 


GAGCCTTGGA 


GGTGTTTTTG 


GGGTGCTGTG 


CTCCCCAGCT 


15333 


AAGCAGCTAA 


CAGTCCTCTT 


TACCTGCCTT 


GTCCTCACCT 


TGCCCCACCC 


TGGGTTGGGC 


15393 


CTCTCGTTCA 


CTCCCTGCTG 


GGTCACCAGT 


ACTTCAGTGC 


AGGTCTCAGC 


TTGATTCTTG 


15453 


GTGGAGAGAG 


AGAAAGTTGA 


TAAATCAGGG 


TGCCTGTCAG 


CCGGAAATTT 


GGGTGTGTCC 


15513 


TGAAGGCACC 


AATGGGGGCC 


CTCCCTTCTG 


GAGGTGGCTT 


T AGG AAGGGG 


TTTCTGGGTC 


15573 


TTGAGGCCTC 


CTTACAGTTT 


CTTAGCTCCA 


TGGGAGAGAA 


GTGAGGAGTT 


GGGTATCGTC 


15633 


ACCCCAGCAT 


GAATCTCTGG 


TCACCTCTCA 


GCATGCACTG 


TCCAGCCTGA 


TCTTTGAGTG 


15693 


CCATAAAAGA 


ACAGAATTAT 


CCTCTCAGAG 


CACTTCATTT 


CCCGCCAGCA 


CAGTGGGTAC 


15753 


AGAGACAAGC 


TGCCCAGACT 


CCCAGCGAGG 


GACTAGTTGA 


GCCCCAGCAT 


GGGACTAGTT 


15813 


GAGCTAGACC 


TGATACAGTC 


CCAGAGAGCC 


TCGTTGAGGA 


^ mm* mm* mmm smm m*m m*m « 

AGCTTTGGGA 


AAATTCACCC 


15873 


AGCATTTCAG 


CCAGGACTGG 


AGGAAAAGGT 


GATTATGGGA 


AAGAGAGCAG 


TCAAGACCCC 


15933 


AGGCTGTAGG 


ACACAGGATA 


CAAACTGAGA 


GCTACCGGAT 


AGGAGTAGGT 


TTTAGTCACA 


15993 


ATCTCTCCTG 


TCCGCCCTAC 


CCTCCAGGAG 


ACATTGCACC 


TTGTAGAACA 


GCTGCCCCGG 


16053 


AGTCCACCTT 


TGGGCCCCCC 


TGGGTAGCTC 


AGTAGTGTCA 


GCATCCTCTC 


ATTGACATCA 


16113 
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GTCAGGTTAC 


ACAGTGGGGC 


AGCTAATGTG AAGGCGCTAG GCTGGGAAGC CAGCTACTTG 


16173 


GGAAAACTAG 


GTTGTTCCTG 


GTAGGCCCTA GCAGGAAGGC AGTTCCTCCT TTTCTTGGTG 


16233 


GCTTTAGGGG 


TCTTTGGAAG 


CTTTGAATGT TCCCTCAGCT CGTTGGTGAA GCAGGCCCTC 




CTGGTACTGT 


GGTGTTTGTC TTCGAAGAGT GAAGGCATTG GAAGTAAAGA CTGATGGGGC 


1 coco 


GCCTTCCCAG 


GATGCTTTGC 


TTCTTGCGCT GGCTTACAGA GCTCTCTTGC TACCTAGTGC 


16413 


CTTGACTTTG 


AACACCAGAT 


TCAGTCAGGG AACAGGAGTA GAGGTCTTGC CTTGCTGAGC 


16473 


CCCTGCGCAC 


TGCAGGAAAA GACTCCTCTG AGTGGAGCCT TTCCTCCTCA GGTGACTGCT 


16533 


TTGAAAGTAC 


AGCAGCCTCT 


GAGGGGGAAG TGTCATTTGA CATTGTGGTA GTTCTTGGGG 


16593 


TCCCTGGATA 


CAGATGTCAT 


GCCCAGATCA TAGGTCTGTT TGTACAGAGG GAGGCGAGTT 


16653 


CTGTAGCTCA 


GAGTCCTCAG 


TACCCCAGAG TTGTGGCTCT AGGGGTGAGA GGAGAAGACT 


16713 


ACAGCCCTTC 


AATCACAGGT 


CTGACCTGTG GGTAGGGGTA GATCTCTTGC ATACTATiGAA 


16773 


CCTGTTTGAA 


ACCCCTGGGT 


ATTTGCTGTG GAATAGAGTC TTGGTTGGGT AAGAATGGTG 


16833 


GATGTTTATC 


TTGGTGTGAC 


TCTCGGGTGG GGGTGGGGGA TATGTCCCTG TCTTTCCCAA 


16893 


TGTAGTATGC 


TGAGTGGACA 


GAGACCGTGT GACTGAAGCC TGGGCTCCTG GAACAGGTGT 


16953 


GTGTTGGTGG 


GGGGTGGGGC 


GCAACTATCT GGGATCCAGA CTGCTTGGGA ATGGCTGTGA 


17013 


CCCAGCTCCT 


TTGATAACAG CAGCTCTTTG TCACTGGATG TTGTGACTAA TGGGACTTGT 


17073 


TGATTCAGTT 


ACTCGGCTCC 


CACCCACAGA CGCCGGGGCT TCTGTTGTGG CACCAGGCAG 


17133 


CTGCAGACGG 


CCCACAAGTT 


TGCCTCGCTT TCCCACTCCA CGAAGGTAAG TTCCCAGCAC 


17193 


TGCCCAAATT 


AGAGACTTGT 


GAGTGGTCCC CTCATACCCC ACTCCCTGAG GCTTCTCCTG 


17253 


GAAGGCCTGG 


AATGGGGCAC TGGGTGTGTA CGTGCTGTGG TTTCTGTTAG GGTCAAGACC 


17313 


AGGCTGTTTC 


TTACCTGGCT 


CGTACCTCCA AGTTTCCAGG TGATGAGTCC TGATTTTTGA 


17373 


AGTGAAGGAA 


TCCATTTAAT 


ATCAAAATTC TGTGACCTTA AATTTTTTTC TTTTATTATG 


17433 


TGTCATTTCA 


TATGTACGCA 


TATTTTTTTG TCTGTGTGTG GACATGCTTG TGGCGATCAG 


17493 


AGGACACTTC 


AGAAAGTCAG 


TTCTCTCCTG CCGTGTGGGT CCTGGGGAAT CAAATCCAAG 


17553 


TTGTCAGGCT 


TTATCCTGAA 


AATAAAAAGT AGACAGCCCT TGGGATCCAA AGCTTCTTAG 


17613 


GGCTGTGTGT 


CTTAGACACC 


ACCAGTGTTG CACAGCTGGT AACATGACAG TGTCCTGGAG 


17673 


TGCTGATTGG 


AAGCCACAGG 


CCTCTGTGCA GGGCGGTAGA CTTCCAGGGT ACGGGGCAGG 


17733 


TGGGCGTTCT 


CTACAAAAAC 


CTTGTAATCG CGGACGTCTT GGAGATGCCC CCTAGGTATC 


17793 


ATGATTTTGG 


TGTGTGACAC 


AGCTGAACTG TCTTCATACT CAGGATATCA TGAAGTGCTG 


17853 
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GGGTGCAGAC 


L AL ILl LAbL 


L X LAlLLALL 


P A PP A OC^HHC* 
LAubALLLbb 


PPPTPPATPA 
LLL X LLA X LA 


PATTPPPPTP 
\jr\ X X LLLw X w 


17913 


ACTACCACAG 


AGGGTLlllT 


tppi"pppppt 
TLL X j. LLLL X 


P APTPTPPPT 
LAL XL X LLL X 


PTPPP APPTP 
L X L LLALL X L 


PP APPA APTP 
LL ALL AAL X L 




lCTllALlCT 


TllxTTAALL 


A ILL XLX bLL 


PAPAPPPPPA 
LALAoLLLLA 


A APATPTTPT 


TTPPP A ATPP 
X X LLL An X LL 




L i X LLAAL X A 


par2T"ppii apt 
L ALL 1 L AAt I 


PTPTPPTPT A 


PTATPTPPTP 
Uinlul LL X Vjr 


APPPAPPPTP 


PTGPATPTAT 

w X wvH X w X f\ X 


1B093 


1 A 1 ALL X X X L 


APAPPPPTPA 
AUAUbUo 1 LA 


TPPPTAPPPA 


PPPPPPPTPP 


TAPfiPAfiftfiT 


TAAGPAPTPA 

X AnUUnvy X w/\ 


18153 


to p* fTi to fn rp to p* ^ 
XLX XLX IX 


TPPTPPTP A A 
X LL X oL X LAA 


PPTPTPPT A A 
X L X X AA 


APA AAPTPAP 


PPP ATTTTPP 

wvvA X X X X wL 


PTTPAPAPAP 


* 1B213 


A TPPTPTP & P 
A X w 1L1 L A L 


PP APPTPTTT 
LLALL Itlll 


PTPPPPPTPP 
L X UuuLU X Vjro 


GGAGPCCGGA 


AfiAAAGGATG 


TACTACAGTG 

X /\ w X AUAVJ X w 


18273 


A P TPP & p A P T 


TPTTPPPTPA 
lbl J. ubL X LA 


TPPPPTTPPT 
X LLLL X X Lo X 


APPTPPTTP A 


PPCTGC5GAAG 

www X OvJOrV/\\J 


TPPTPTTTPT 

X ww X w X X X w X 


18333 

J»Owww 


T ATPTPTT AP 
X SIX L XLX XAL 


AP ATPPTP AP 
AL A 1 oU X UnL 


PTPAPPAPPA 


PTPPAPPAAP 


TPPATPPPAP 
X wwr\ X wwwMw 


GTGTAGGAAT 

w X w X AwwAA X 


18393 


LAAAL 1 LAL X 


LLLLAAAA 1 1 


A TPTPP A PPP 


PT APPPA APT 
t L X ALLL AAL X 


PTP A PPPPTP 
L X LALLLL X L 


TPTPPPPTTT 
XLX LLLL XXX 


1 R453 

X> OvJW 


LTllTGGAlx 


GGGCCCCTGC 


p* toto C p»p* mp t p 
tl XLLLXLXL 


TPTPP APPPT 
XLX LLAbub X 


pmmmppmpm a 
L X X X LL X L X A 


LALLL LLALL 


1 OC1 Q 

XOjXO 


P'P* A fPOPPP A A 


ALT ALALLAL 


X UAAL X LL X L 


LXXXLXXX LA 


PPPTPTPTPP 
LLL X L X L X LL 


APTTPTPATP 
AL X X L X LA X L 


i n^7^ 

iOJ / O 


I A I LTALL x T 


p* TV m a A rn/'^p»mrn 

wATAATCLTT 


pmuppma A a a 
L X ALL X AAAA 


P A A PPPTPT A 
LAAbLL X L X A 


A PTTTPPPTT 
AL XXX LLL X X 


TTP A A ATPAP 
X X LAAAX LAL 


XODOO 


CCAGCTTCCA 


GGCTCGATAG 


fn Ti Tv tv tv m 
TAlGAAllAT 


p* Tv Tv A tv mpmmm 

GAAAA X L xTT 


pmm * p«pi tv mp A 

L X X ALLA X LA 


LL XXL XXX XL 


1 QAQQ 
I0O7O 


TAGTGTGTGT 


GTGTGTGTGT 


GTGTGTGTGT 


GTGTGTGTGT 


pfnPffiPmPfr a o 

GTGTGTGTAC 


/-»m AP*APAfnAm 

GTACACATAT 


Xo/Do 


GTACCTCTAT 


CAGTGTGCTG 


TGCGTGTACC 


iv/^iv/^»OTi/*^iv/**m 

ACAGCAGACT 


CGTGAGGAGG 


mn » ppp» tv tv O 

TCAGGCAAAC 


X00X0 


mmmjvrniv » jv » * 

TTTATAAAAA 


TCTTTTTTTT 


TTGCTTCACT 


TGAGTlllAG 


ppmp A P* A A P 1 

GGTlAlACAG 


mpPPR AP , ^T1P•P , 

TGllAAlTll 


1 PQ70 

loo /o 


TlAGlTwTGT 


TCTCTGTTCT 


TlATx TLTT X 


WPTIPTiPpTiPP 

XL X LALLALL 


X LA XLX XLX X 


a Rpppmmppp 
AALLL X X LLL 


X OzfOO 


flAf2f2PPA A AP 


pt a r , nnr >r vnn 

L 1 ALL oL 1 LL 


mmppm^ppmm 

til oU X X L X X 


LLLLonb X OU 


CCZCZTC* A A TPP 
LLL X L/iM X LL 


PTAPAPATPT 

w X AUALA X w X 


18QQ3 


PT A APPPPTP 


PPP AP APPTP 
u^UnU ALL X L 


PTPPA APPAP 


PPAPPTPAPA 


APTGATAPTG 
r\\3 x vjrv x r\ w x w 


AGATGPPPPT 

t\\Jt\ X wwwww X 


19053 


GTGAGCCCTC 


TCGAACACAC 


APPPTTPTAA 


ATAPTAPPTP 


ATTGTTACAT 

xTk X X w X X Avo X 


TGGAGAPTCG 

X WtfAOAV X ww 


19113 


TCAGCTGGGT 


GGAGTCCTGG 


TTPAPAPPPA 
X X L ALA LLLA 


PTTATTPPTP 
VJ X X A X X LL X L 


PPPPPAP ATT 
LLLLLALAX X 


TPTTPTPTTP 
XLX XLXLX XL 




TGGGGCTGAA 


GTCTCTTCCT 


TPPTT A PPTP 
X UL X X ALL X L 


TP ATPPTPTP 
X LA X LL X L X L 


A TP AT APPTP 
A X LA X ALL X L 


PPAPPTPAPA 
LLALL X L ALA 


1 Q9^^ 


GTGGAGGCGG i GGC AGTC AGG 


P APPTPPTTP 
uAbt X LL XXL 


XLX X XLLX X A 


PP APPPPTTP 
LLALLLLX XL 


PAPAPTTPPP 
LALALX X LLL 


1 <J90^ 

17670 


GTGTAGGGGT 


TGGCTCCCCC 


TTTPPPTPPP 
XXI LLL X LLL 


PTPPAPPTPP 
L X LLALL X LL 


X X X L X LL XXX 


PAPPAPAPAT 
LAL LAL ALA X 


17wJ<J 


TCGTTCTAGA 


AACTTGTTGC 


GTAAACAAGA 


TCACAAAGCG 


ATAAGCTTGA 


GCAAAACCCA 


19413 


GGGGAACAAA 


TTGCTTCCCT 


GTGAAGACCC 


AATCTTAGCT 


CTTAGAGAAG 


CCCTCCCTTT 


19473 


TGGAAATTGC 


TGACTTTCAG 


GGCTTCTCTG 


TGGAGGAAAG 


AGGCTAGCCG 


CCGTATGTTT 


19533 


GCCTGGATTC CAATAAATCT 


TTGCGGCCTT 


GGCTACCCCT 


TGTTGAACAA 


GGTCTGCACT 


19593 
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CCTAATGCGT 


GCCTCAGdTG 


GTCTGAGACC TCTACCCCAT CTCCAGCTTT TCCTTCCTAT 


19653 


GGAGGGAGTC 


AGTGGGTTAG 


GAGAGAATGG AGTTGAGTCC TGGAATGAGG AGGAAGCTAT 


iy /io 


GAACTCGGGG 


CCTGTTCCTG 


TCTGGTGGGT GCTCTTCTCC GCCGCTGAAG GAGGCAbUUb 


1 Q77^ 


CAGGGAAGAC TACCACAGGA 


ATCCGAGTAC CACCTGGAGC AGTGTATACA GGATGTGGGC 




TGATGTGTGG 


TAAGGGCATG 


ATGGGCTGAT GTGTGGTAAG GGCATGGGAT CTGATTGCTC 




TGTGGATGGG 


CCACAGGGAA 


ATTTTTGAGT GTCTACTGCA GTAGTTCTCA ACCTGTGGGT 


19953 


TGTGCGCCCC TTGGTGGGAG TTACATATTA GATATTTACA TTATGATTCA TAACTGTAGC 


20013 


AAA AT T AC AA 


TTGTGAAAGA 


ACCAAGAAAT CACCGCAGCA TGAGAACCTG TATTAAAbbU 


200/O 


TCACGGTGTT 


AGGAGGGTTG 


AGAGCCACTC ATCCTCTGGG TCTAGGCCAT GGU^buuiui 




AACTGCTCTC 


TGGAGTTAAG 


CCACAGTGAA CCAGCTGTCC TTGCAGATGG ACTTGTGGAG 


20193 


GCTCCAAACC 


TTTGTCCCAG 


GGGAGAAGAG CTTGCTTTTG CTTTGTACTT TTAAAGGAAG 


20253 


TTCAGTGGTC 


TTCGGGCCTT 


GTGGCTGCTG TGTGTGGAAG TGCCCCTGTA CAATAAGCTG 


20313 


TATAGATCGT 


GTACAACTGC 


AGTTTTCCTC CGTGGGTCCA CCAACCACTC CTGACTCCAC 


20373 


GGATGAGTGA 


GGCCAGTAGG 


GCTGTGTGTG GGTCCCTAGG CCAAGCATCC TGGACCACGA 


20433 


TGAGCCTCAG 


CTAGACCACT 


CTGGATCTTT AGCAGAGGCT CCTAGAGAGC TGGL-Tt»t?c 1 l 


20493 


CCTCCTGCCT 


TCTTTTCTCT 


TAAAACTTCG TCTCAATCGG AAGCTCCTCT GTGCAUOlUrA 


20553 


CCTCCAGGCC 


TGGGGGTCGC 


CACAAATCCC CTCATCACAA GACGAGCAC?L, ILbv-AibAbb 


20613 


GACACGACAC 


TTGTTACCTA 


CCAGGCTGTG oCjvj T 1 11 1 o 1 1 1 1 l?v* i lb niiuiniu 


20673 


TTTTGTTTTT 


TTACTTGTAC 


AGAAGTGTTG TGACATCAGA TGTCAGCTGT TAGTGCTGGC 


20733 


ACCATTTTAC 


AGGTAGGGAA 


CTGAGGCTGT AAGATGTGTA GTGACATCGC TAAGGCCACT 


20793 


CAGTTGGTGA 


GGCCTTACCA 


AGGTCAGGTC TTTGGAGCCT TTTGCTGAAC CATGTACTTC 


20853 


TATCTCTGTT 


TTGTTGAAAC 


AAAGTCTATA TGGCTCTGGC TAGCCTATAA CCCCATATGT 


20913 


AGACGAGGCT 


GACCTCGAAT 


ACACTGCAGT CTTTTATGTC TGCCTTCTGG GTGGCAGGAT 


20973 


TGAAGGCATG 


TGATTCCTCC 


TAACTGTACA CTTTAAAAAA AAAATCATTC TTTGTTCTGG 


21033 


TCTGTGCCAG 


GGCCTTGTAA 


GATGTTCTGT GCTGAGCTGG GCTATTTGGG TTAGTCTCAT 


21093 


TGCTGAGCAG 


GGCCCCTGTA 


TCTTCCTTCT CTGTCACTTG CTTACCTGGG TCTTCCTCCT 


21153 


GCACTAGCTA 


TCCTAGAACC 


AGTACTGAGA GCAACTATGG GCCCAACTCT GCCCCTTGCC 


212i3 


CAGCCTGCTT 


AGCTGGGGGC 


GGTGTTCCAC TTCCCTGCCC AAGTCCTGTG GGACTGTGTT 


21273 


TGTACTCCAC 


CACCTTCAGT 


TCCTTGGAGC TGGAGCAGGC CAGGCGGCTG CATTCCTGCA 


21333 
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m m /"* m m t*y 

GCTGCTGTTG 


CCAGGGAGAG 


CCCATCCCAT 


nm tv omm/"» tv nm 

TCACTTCAGT 


CTCCTTAATG 


TAUAAvjCLTI 


Ol OQQ 


Amo /"» tv iv mm tv 

GTCGAATTAG 


CTTCCACTGT 


r>r\r+r* tv tv r*r*r* tv 

CCCCAACCCA 


iv 0 tv om tv nonm 

AGAGTACCCT 




0-7V /^m TV TV OTV TV n 

WtU 1 AA0AA0 


01 / CO 
ZX4QO 


GCCAGGATAC 


AGTCCTTCCT 


pm/"*nom/"» tv rn tv 

GTGGCTGATA 


AGACAGGCCT 


rr\ f*r^r^ tv p tv tv no 


/"V»fpr»f*r^ tv 00 tv 


ZlDlO 


CACTGTGTGG 


GCAAAGCTGC 


mmo iv r*r^ tv c*r*r* 

TTCAGCACCG 


71 mnnomnnm'p 


UATVjUL-AAVjU 


X XV7l?l*lt XajU 


zio/o 


TTCTCACAGT 


TGAGACTTCT 


GTGCGCACAC 


CCACTGTCTA 


GCTCAGCTGG 


tv tv om/"* tv iiimiti 

ACACTGATTT 


01 coo 


mprnmm tv iv tv mo 

TCTTTAAATG 


m tv m x n x mm mm 

TATAGATTTT 


GGGGTGGGGT 


pfppnrpnT, tv tv <*» 

GTGCTGAAAv? 


U I UUL.AL* X 


X UUUULtAAGC* 


ZX070 


U X t?Ab 111 U A 


IjAU 1 A 1 OA 1 U 


AA 1 X va A 1 but 


mmmp tv rpPOpm 


A i tAUnwU 1 1 


P m f2 mm r*pr' a 0 

Llljl lUULAb 


91 

^1 / SO 






AfiAf2r*A m r , r ,m 








91 R13 


CTCCACCTGG 


CACCAGGCTA 


TGATGTTATG 


GCATTGAGGG 


TATGAGAAGG 


ACCAGGGGTT 


21873 


TCCCAGAGTT 


ACGCCCAGGC 


GCACAGGCAA 


TTGTTTCCTA 


CATGTGTGGC 


TGGAATGGTT 


21933 


GGGTGAGCCT 


TTTCAGCTGC 


CTACAATAGG 


AACCCAGGGA 


TACTGGGCAT 


TGACCAAGGC 


21993 


ATATCTCATA 


CCCTTTTCTT 


ATCTTTCTGC 


AG CAA ATT 
Gin lie 


GTG GCT GTA AAT GTT 
val Ala Val Asn Val 


22046 



25 

CCT CCT GAA GAT CAG GAT GGC TCT GGG GAT GAC TCT GAC AAC TTC TCT 22094 
Pro Pro Glu Asp Gin Asp Gly Ser Gly Asp Asp Ser Asp Asn Phe Ser 
30 35 40 45 

GGC TCT GGC ACA GGTAAGACTG ACCCAGAACA CTGAGATGGC ATAGATCATG 22146 
Gly Ser Gly Thr 



GCTGGAGTGG 


TGAGCAGGCA 


GTCACCCAGC 


TTTTAGTGAA 


CCCCCTTCTT 


CTCCCATCCC 


22206 


ATCCTTAGCC 


ATTGGAGTCA 


GGACAGTGCC 


AAAAGGAAGA 


ATGGTATCCA 


GCTGCAAGCC 


22266 


ACTCAGCTAA 


GAGAAACTCT 


CAGAGAAATG 


AAGGGGTTCC 


ACCAGGCCAT 


GGGCAGCCAC 


22326 


TAGAGCCAAC 


CCTTGGAGGA 


GTTTGACTCC 


ACTGAGCCTT 


GGTGTGGTGT 


TTCCATCTGT 


22386 


GAGATGGGAA 


TACTTTGCCC 


AAGAGCCTGT 


taGaagctgt 


AGGAAGCACA 


GAGTCGGCTA 


22446 


GGTATAGATT 


TGCTCTCACC 


TCCATCTCTC 


GATACCAGTT 


CTCTGCAGAG 


CTTGGGTTTG 


22506 


TGGGAGGGGT 


GGGGGGGTGA 


GGGGAGAAGG 


CTGTGAGCTG 


CAGCTAGCCA 


GAGGGGTCTC 


22566 


CCAGAAGAAT 


GGGGAGAGCT 


AAGAAGGAAA 


GTTGAGGTCA 


CAGTGGGAAG 


GAGACCAGAG 


22626 


CAAAGGGTTG 


GAAGGTAGGT 


AAAATGCAGC 


CGTGTATTCT 


TGGGAGCCTT 


AGGCCTTGGG 


22686 


CAAGAGGGTA 


GAAGAGGTGT 


TTGTCCTGGG 


CTGCAGTCCT 


GTATCAGCTC 


TGGTGTCTTG 


22746 


GCCCACGCTC 


ACAGCAGGAT 


CCCTTCCCAG 


ATTCCCGAGA 


ATTTCTCACA 


GTTCAGAGAG 


22806 


CACGCTACTT 


GTAGGCAGGT 


GAGGCTGCAA 


AGGACAGCTT 


TTCTGGCCTA 


ATTTTCAAAG 


22866 
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TGAGTTCAGC CTTTGCTAGG TCACCTTTGG GGTCTCAGAA GGCTTCAGCT CCTGGTAGAG 22926 
CATGAATCAC GTCAGGCGTG ATGCTGGAGA CCTCTCCTAC CCTGACACCC CAAACCCCCA 22986 

CCTCTGACCC TGCA GGT GCT TTG CCA GAT ACT TTG TCA CGG CAG ACA CCT 23036 
Gly Ala Leu Pro Asp Thr Leu Ser Arg Gin Thr Pro 
50 55 60 

TCC ACT TGG AAG GAC GTG TGG CTG TTG ACA GCC ACG CCC ACA GCT CCA 23084 
Ser Thr Trp Lys Asp Val Trp Leu Leu Thr Ala Thr Pro Thr Ala Pro 
65 70 75 

GAG CCC ACC AGC AGC AAC ACC GAG ACT GCT TTT ACC TCT GTC CTG CCA 23132 
Glu Pro Thr Ser Ser Asn Thr Glu Thr Ala Phe Thr Ser Val Leu Pro 
80 85 90 

GCC GGA GAG AAG CCC GAG GAG GGA GAG CCT GTG CTC CAT GTA GAA GCA 23180 
Ala Gly Glu Lys Pro Glu Glu Gly Glu Pro Val Leu His Val Glu Ala 
95 100 105 

GAG CCT GGC TTC ACT -GCT CGG GAC AAG GAA AAG GAG GTC ACC ACC AGG 23228 
Glu Pro Gly Phe Thr Ala Arg Asp Lys Glu Lys Glu Val Thr Thr Arg 
110 " 115 120 125 

CCC AGG GAG ACC GTG CAG CTC CCC ATC ACC CAA CGG GCC TCA ACA GTC 23276 
Pro Arg Glu Thr Val Gin Leu Pro lie Thr Gin Arg Ala Ser Thr Val 

130 135 140 

AGA GTC ACC ACA GCC CAG GCA GCT GTC ACA TCT CAT CCG CAC GGG GGC 23324 
Arg Val Thr Thr Ala Gin Ala Ala Val Thr Ser His Pro His Gly Gly 
145 150 155 

ATG CAA CCT GGC CTC CAT GAG ACC TCG GCT CCC ACA GCA CCT GGT CAA 23372 
Met Gin Pro Gly Leu His Glu Thr Ser Ala Pro Thr Ala Pro Gly Gin 
160 165 170 

CCT GAC CAT CAG CCT CCA CGT GTG GAG GGT GGC GGC ACT TCT GTC ATC 23420 
Pro Asp His Gin Pro Pro Arg Val Glu Gly Gly Gly Thr Ser Val lie 
175 180 185 

AAA GAG GTT GTC GAG GAT GGA ACT GCC AAT CAG CTT CCC GCA GGA GAG 23468 
Lys Glu Val Val Glu Asp Gly Thr Ala Asn Gin Leu Pro Ala Gly Glu 
190 195 200 205 



GGC TCT GGA GAA CAA GTGAGTGGCT TTGCATTTCC TGGGTGGCCA CTAGTGCCTG 23523 
Gly Ser Gly Glu Gin 

210 



CACCTGGCCG 


CCTAATGTCC 


TCATTACAGT 


GACAGGTGAC 


AGGGTCCCAC 


CTTCCTCCTG 


23583 


CCCGAAACAG 


ACTGATTGCA 


AGATCAGGAG 


GTGGGCGACT 


CCTTAGATGT 


CATTCAGGAG 


23643 


CTTACAGCAG 


GGTGAATTTT 


CCGTCTTAGA 


CCTTCATGGG 


AATTTTCACA 


CAACAATGTG 


23703 


TACGTTGTGT 


CACTGGAGGC 


GGTATCTGTG 


TCTTGGCCTG 


CCAGGGTCCC 


AGGTGTGACT 


23763 



GACTGCATTC CTTGACAGAT 



GCTGGTATAG GTTGGCTACG TCTGATGGGG GTGGCAGGGG 23823 
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ATCCCATCAG GTATGGCACT GCTCAGGTTG CTGTTGTGTC AGTGGCTCCA GCTGACCTGA 23883 

TCCCAACCTA CCCTTCTGTA G GAC TTC ACC TTT GAA ACA TCT GGG GAG AAC 23934 

Asp Phe Thr Phe Glu Thr Ser Gly Glu Asn 
215 220 

ACA GCT GTG GCT GCC GTA GAG CCC GGC CTG CGG AAT CAG CCC CCG GTG 23982 
Thr Ala Val Ala Ala Val Glu Pro Gly Leu Arg Asn Gin Pro Pro Val 
225 230 235 

GAC GAA GGA GCC ACA GGT GCT TCT CAG AGC CTT TTG GAC AGG AAG GAA 24030 
Asp Glu Gly Ala Thr Gly Ala Ser Gin Ser Leu Leu Asp Arg Lys Glu 
240 245 250 

GTG CTG GGA GGTGAGTCTT CTTTCAGGTG GAGAGGAGGA GGCAGGTGGT 24079 
Val Leu Gly 
255 



GGCTCTGAGG TAGCCTGGGT TGCTGGGGTG AAGCATCTTT AGCAGCAGGG TGGGGAAGGA 24139 

GGAGGGTCAA TTCTACTCCA GGCCACCTCC TAGGCTGTCC GTCTAGTCTG GGAGAGACTA 24199 

CCACTGACCC CGTGGAGCTA CTGATCTGAG CCTGCCTCTG TTCACTCCCT A GGT GTC 24256 

Gly Val 



ATT GCC GGA GGC CTA GTG GGC CTC ATC TTT GCT GTG TGC CTG GTG GCT 24304 
lie Ala Gly Gly Leu Val Gly Leu He Phe Ala Val Cys Leu Val Ala 
260 265 270 

TTC ATG CTG TAC CGG ATG AAG AAG AAG GAC GAA GGC AGC TAC TCC TTG 24352 
Phe Met Leu Tyr Arg Met Lys Lys Lys Asp Glu Gly Ser Tyr Ser Leu 
275 280 285 

GAG GAG CCC AAA CAA GCC AAT GGC GGT GCC TAC CAG AAA CCC ACC AAG 24400 
Glu Glu Pro Lys Gin Ala Asn Gly Gly Ala Tyr Gin Lys Pro Thr Lys 
290 295 300 305 

CAG GAG GAG TTC TAC GCC TGATGGGGAA ATAGTTCTTT CTCCCCCCAC 24448 
Gin Glu Glu Phe Tyr Ala 
310 



AGCCCCTGCC 


ACTCACTAGG 


CTCCCACTTG 


CCTCTTCTGT 


GAAAAACTTC 


AAGCCCTGGC 


24508 


CTCCCCACCA 


CTGGGTCATG 


TCCTCTGCAC. 


CCAGGCCCTT 


CCAGCTGTTC 


CTGCCCGAGC 


24568 


GGTCCCAGGG 


TGTGCTGGGA 


ACTGATTCCC 


CTCCTTTGAC 


TTCTGCCTAG 


AAGCTTGGGT 


24628 


GCAAAGGGTT 


TCTTGCATCT 


GATCTTTCTA 


CCACAACCAC 


ACCTGTTGTC 


CACTCTTCTG 


24688 


ACTTGGTTTC 


TCCAAATGGG 


AGGAGACCCA 


GCTCTGGACA 


GAAAGGGGAC 


CCGACTCTTT 


24748 


GGACCTAGAT 


GGCCTATTGC 


GGCTGGAGGA 


TCCTGAGGAC 


AGGAGAGGGG 


CTTCGGCTGA 


24808 


CCAGCCATAG 


CACTTACCCA 


TAGAGACCGC 


TAGGTTGGCC 


GTGCTGTGGT 


GGGGGATGGA 


24868 


GGCCTGAGCT 


CCTTGGAATC 


CACTTTTCAT 


TGTGGGGAGG 


TCTACTTTAG 


ACAACTTGGT 


24928 
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TTTGCACATA 
AAGGCAAGTC 
GAATCGGGGG 
AAATCTGGAC 
CGTACAACAG 
TGGCTAGTCC 
ACTCAGGGCC 
TACTCGAAGT 
TGGCAAGGGT 
GAAGTTGACA 
TCACCTTCAG 
GAGTCTTTTG 
CAGATGTAAT 
TCTCTGGACT 
GCTTGGCTTA 
GTTCCTCTGA 
TTTTGGAAGT 
GGTTTAAGTA 
TTGCAGGGCA 
ACCCCTTGCT 
GCTAGACAAA 
TGGATGTGAT 
TGGGACAGTG 
TCCATTTGGA 
TCCAACTCTA 
AAATAGCAAC 
TATCCAGGAC 
CTAGGTTGTT 
CT*TTTGTTCT 
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TTTTCTCTAA 
TGTTGACTGG 
TTAATAAGAC 
GGCAAAACGT 
GGTATGGACT 
AGGATACTGT 
ACGGCCTTGG 
TCACAGTCTA 
GCCCAAGCGT 
AGACACATCT 
CTCCTGTGGC 
CTTTTGGCAA 
AAACTTTAAT 
GGTGGTATAG 
AGGCTGCACG 
TTTATTTGGT 
TGGTAAGTTC 
ATTGGGACTC 
GACACCTGGG 
TACTTAGCTC 
AGATCTGTGC 
GTGGGGTGGG 
TCAAGCCTCA 
CAGCAGACAC 
TCTGGTTTTT 
AGTCAGTGGC 
TCCCGACTTT 
CCTTTCCCTG 
GTTTGCTTAT 



TTTCTCTGTT 
TGTCCCTCAC 
TTTTTTTGTT 
AGGCTTAGTT 
ATCTGTATGG 
GGAATAGCCA 
CCTGGGCCAC 
GGAGTGGAGG 
CTCCCACCTT 
TGAGTATGGC 
CCCGCCCCCA 
AACGCTACTT 
ATAAAGGAGT 
ATATAGCCAC 
CCTCCAATCC 
TTTGTTGTGT 
AGCCAAGGTT 
TAGCACATCT 
ACAGCCTTCC 
AGTGATGTTC 
CCTGTGTTTC 
GGTGGGGTCA 
GGCTGGCGCT 
TTTGGTGTAC 
GGAATGGGAG 
ATCCTTGCCC 
GCACCTGTTT 
GGGTCAGGAG 
TGGGCAATTC 



CAGAGCCCCA 
CTCGCTTCCC 
TTTTGTTTTT 
TGTGTGTTGT 
TGCCCCATTT 
CCTCTTGACC 
CGTGACATTG 
GGAGGAGACT 
TGGTACCATC 
TGGCACTGGT 
GGCTGGAGTC 
AATCCAATGG 
CCTATGAACT 
CCTTTGCCCA 
CCCAAAGGTA 
TGTGTTGTGT 
TTACAGGCCC 
TGACCTAGGG 
TCCCTCATGT 
CAGCTCCTGG 
ATCCCAGAGC 
TATCTGAGAC 
CATTCATATA 
TTGTGCAGTC 
CCTAACTGGC 
AGGCCCCAGG 
TCCCTCTGTG 
GATTCTGCTG 
TCAACCTTCA 



GCAGACCTTA 
TAATCTACAT 
GTTTTTAACC 
CTCTGAGTTT 
TTGGCGGCCC 
AGTCATGCCT 
GAAGAGCCTG 
GTAGAGTTTT 
TCTAGTCATC 
TCCTCCATCA 
AGAAATGTTT 
GTTCTGTACA 
CTACTGCTTC 
AACCCTGGTA 
GGATCCTGGC 
TTTTCTTTTG 
TGATGTCTGT 
TCACTAGAGC 
TTGCTGGGAC 
CTAGGCTGCT 
TTGTTGCCAG 
AGCCCTCAGC 
ATTGCAATAA 
TCTTTTTGGT 
CTGTGTTCTG 
GCAGGACTAT 
TGTAGCATCA 
ACTCTGAATG 
CTAGCAACAG 



TTACTGGGGT 
TCAGGAGACC 
TAGAAGAACC 
GTCGCTCATG 
GTAAGTAGGC 
GTGTGCATGG 
TGTGAGAACT 
GGGGGAGGGG 
CTTCCTCCCG 
AGAACCAAGT 
CCCAAAGAGT 
GTAGATTTTG 
TGCTTCTTCT 
GCTCGGGGAA 
TGGGTCCAGG 
GCTAAACTTC 
TCTTCTAAAT 
TAAGCTTGCT 
ACTGCTGAGC 
CAGCCACTCA 
ATCACATGGC 
TGAGGGCTTG 
ATGGTACGTG 
CTGGACCATG 
GCTTGGTACC 
GCTCTTGCCA 
TGAACTCCAG 
TCAGGATTTG 
TCTCATGTGT 



24988 

25048 

25108 

25168. 

25228 

25288 

25348 

25408 

25468 

25528 

25588 

25648 

25708 

25768 

25828 

25888 

25948 

26008 

26068 

26128 

26188 

26248 

26308 

26368 

26425 

26488 

26548 

26608 

26668 
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CAG6ATTACA AGTATTGCTT GCACATTGAG GG 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

( B ) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: .2: 

Met Arg Arg Ala Ala Leu Trp Leu Trp Leu Cys Ala Leu Ala Leu Arg 
1 5 10 15 

Leu Gin Pro Ala Leu Pro Gin lie Val Ala Val Asn Val Pro Pro Glu 
20 25 30 

Asp Gin Asp Gly Ser Gly Asp Asp Ser Asp Asn Phe Ser Gly Ser Gly 
35 40 45 

Thr Gly Ala Leu Pro Asp Thr Leu Ser Arg Gin Thr Pro Ser Thr Trp 
50 55 60 

Lys Asp Val Trp Leu Leu Thr Ala Thr Pro Thr Ala Pro Glu Pro Thr 
65 70 75 80 

Ser Ser Asn Thr Glu Thr Ala Phe Thr Ser Val Leu Pro Ala Gly Glu 

85 90 95 

Lys Pro Glu Glu Gly Glu Pro Val Leu His Val Glu Ala Glu Pro Gly 
100 105 110 

Phe Thr Ala Arg Asp Lys Glu Lys Glu Val Thr Thr Arg Pro Arg Glu 
115 120 125 

Thr Val Gin Leu Pro lie Thr Gin Arg Ala Ser Thr Val Arg Val Thr 
130 135 140 

Thr Ala Gin Ala Ala Val Thr Ser His Pro His Gly Gly Met Gin Pro 
145 150 155 160 

Gly Leu His Glu Thr Ser Ala Pro Thr Ala Pro Gly Gin Pro Asp His 

165 170 175 

Gin Pro Pro Arg Val Glu Gly Gly Gly Thr Ser Val He Lys Glu Val 
180 185 190 

Val Glu Asp Gly Thr Ala Asn Gin Leu Pro Ala Gly Glu Gly Ser Gly 
195 200 205 

Glu Gin Asp Phe Thr Phe Glu Thr Ser Gly Glu Asn Thr Ala Val Ala 
210 215 220 

Aia Val Glu Pro Gly Leu Arg Asn Gin Pro Pro Val Asp Glu Gly Ala 
225 230 235 240 
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Thr Gly Ala Ser Gin Ser Leu Leu Asp Arg Lys Glu Val Leu Gly Gly 
245 250 

Val lie Ala Gly Gly Leu Val Gly Leu lie Phe Ala Val Cys Leu Val 
260 265 270 

Ala Phe Met Leu Tyr Arg Met Lys Lys Lys Asp Glu Gly Ser Tyr Ser 
275 280 285 

Leu Glu Glu Pro Lys Gin Ala Asn Gly Gly Ala Tyr Gin Lys Pro Thr 
290 " 295 300 

Lys Gin Glu Glu Phe Tyr Ala 
305 310 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TCTAGAACAC TTATTAAGAG CCAGGCACTG AAAAGTGCAG ACTCCCTCAT TTCATCCTGG 
CCGTGCTTAC AAGTAGTTTC CATGCTCTGG TAACCCTGTG CAGAGGGCAG CGTGGGAGGC 
GGGCCGCTTG GTGGACGGTC ATGGGGGCTC TGCATGGGTG GTTGCCCTTG CCTCAGAAGA 
ACTCCCTAAG TAAGAGCAAG TTAGCCTCCC TAACCCCTGG. TGGGTTGTTG CTTCTTTTCT 
CCTCTTGTTT CTGCCAAGAG AGGGTGGACC AAGAAGACCC CAGCCTACAG AACATGTGAT 
CCAAATAAAC TTCTTTTTAG TATAAATGTC CTAGCCTGTG ACGTTCTGGT AGACTAGCAC 
AAGATGGACC AAGACAACTC TCATCGAGAC TCTGAGGAAC GAACTGGCAT CACATGGGAA 
CAGGAAATGA AGCTTAGAGA GAGGTTCTGT GGCTTGTCCA ACATGGCTGT AGTTTAAATC 
CAGCTTGCCA CCAAAGCACA CACATTTCAC TGCTGTGCTG GGCCGGGCCT CAGATCCCAG 
GGGCTCCGGA GCTAGAAGGA CACGTGTATC AGCCATGGCT TCAGTTTATT GCTGTATACT 
CTGTGCTTCT GGCTCTCATG GAAAAGACAG ACATTGGGGT TCTTATAATC TCTCCCTCTC 
CCCTCCCCAC ACTCTATCCC CAAAGGAGGC ACCACTTCTG CAGGTAAATG TTATCTTCAA 
AGCGCTCACA TCGCAACCTT TGCCCACACC ATCTCATTAA AGGAATTGGC AGTGACTTTA 
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AGGTGAAAGA 


tv pnipppiponp 
AC X CGGTGGC 


m tv PPfnpmm tv rn 
TACGTGTTAT 


Tv rn tv A A rnrnrpp"'^ 
ATAAATTTGG 


Tv mpmpppmpm 
ATGTGGGTCT 


GAGAGGTGGA 




» ppjv 7v p , p > */" , a p* 
ALGAAGGCAC 


mr*f**f~* A rn a a rn 
X LLtA X ACAT 


P*P* A Prnp , rnp , rp A 

GCAGTCTGTA 


P^ A mpp KPHipp 

GATGGAG X GG 


P* A fTtP^ AfPP'P' HP 

GA X GATGGAG 


p TV T.PJI UPTiPTi 

GAAGAAGAGA 




1 lbl LA X 1 1 A 


n^PT , p , /^p»Tr?p , Ti 
X btUtL X GC X 


npmp A P'P'P'P" A 

GG X GAGCCCA 


pp a Tirnppptpp 
GGAA X GGG X G 


m a PP A prfipmo 

X AGG AG X G X G 


»PP»PPTAPPfTlpm 

XGXGAGGXGX 


you 


AGGGCCC 1 GC 


TTGTGTATGG 


AAA A f^f^r^tvtrn TV 

AAAACGCT T A 


GTGTTT X ATA 


GGTATTTTGT 


p tv p tv tv m tv r^mm 

CAGAAT ACT T 


1UZU 


m t\ TV PP A A Prnrp 
1 AAGG AAL 1 X 


P* TV P*P* TV TV A prnm 

G ACC AAAGT T 


a p* a r^r^f^ a prm 
ACAGGGAGGT 


fnapRPTipTimni 
T AG AC AGAT 1 


pmp a rpOf^rn tv rn 

G X GATGGT AT 


a a PPinPm 

ACTCACCTCT 


1 PVQ P\ 

lUoU 


fiTPTPTP apv* 


p , mp»p»rn a tv pnip 


pp A PPOipfp»prp 

GGACC X C X X X 


A P'rpprnppprnrp 
ALXCXCCCX X 


PTiPPPRPPPA 

GAGGGAGGGA 


pfppppupTinip 
GTGCCACATG 


1 1 Af\ 
X14U 




PPP ta tv mPTip 
GGCACA X G X C 


TPPTPPHifP'T' A 

X GG X GG X X X A 


PPTPPPl A P'P* 

CC X LLU AAGG 


P* A PPTP A A POl 

GAGG X GAAG X 


ppppTV TVP»P»rnA 

CCCCAAGGTA 




PJ r»rp jv f*P*p * p rr» 


X LX A X ILlbl 


A ATTPAP.AP2A 

AA X X LAO ALA 


PP.P A 71 a TP A & 
LLCAAA X LAA 


AC 1 LI 1ALAA 


mpmfnmppppTi 
X GT a X GGCCA 


1 oAn 




tv r*r* a tv tapoipp 
AbLAAAb X GG 


PPPTTa TiPTVPIP 

CCC X AAG AG X 


PRPPS A RPAP 

GAGGAAAGAG 


AG X GGG X GGG 


TTCACTGCCT 


1 OOP 

132U 


PTPTPR TV Tlpp 

vjlbl GAA X CC 


otnp*o TtPKmxP 

CTGCAGATAG 


fTX*fn/^fno tv mpm 

TCTCTCATCT 


TGGTGCCCTT 


CllACAGAGG 


PfnPPP^pppn 

CTGGGGCGGC 


1380 


AGGAGGGAGC 


CTGGACAGCT 


CAGACACTGG 


GTCATTGATG 


ACTGTTGTGT 


GGGATACCTG 


1440 


CCGGGGCGCA 


GGAGTGAGCC 


ATGCCACCCC 


% ^v v i™f™i y** m 

AGGAAGTGGT 


TCAGGGTGAC 


TCTTCTTGGC 


1500 


tv p tv nr«m^»^n 

ACACCTGGGA 


GGATGTAGCT 


GGTGCTGGCA 


CACCCACCGT 


CACGAGAGCT 


TCCTGTCCAA 


1560 


A P^P^mrnp a A P tv 
ACCTTCAACA 


AAGGCGGCTT 


CTTGAGACAG 


GCTAGACTGA 


AGTCACCAGC 


CTTGGGTGGG 


1620 


pvrip*p> tv 9v mp 

G X CCACT atg 


m tv tv /~* r**mfi tv phi 

TAACCTCAGT 


GCTCAGGAAC 


CCTTTCCCAT 


ACTGTCTGGA 


ACTATACTGT 


1680 


STPTTiPPTPP 

A X G X AGCTGG 


Pinmmpri tv OPP 

GTTTCCACGC 


ATGTGTGCCT 


pp tv r^nr* tv pmp 

GCACCCAGTC 


p tv mPm^t tv m/^m 

CATCTCATCT 


TCTATCTCCC 


1740 


T , r , P'p , p ,r P'pn*»p , P" 


CGCT rCCCCC 


G X GGGGAGTG 


mpp tv mpmp a rn 

TCCATGTGAT 


pirtrnppTiinppp 
CTTCCATCCG 


CACCTCTTCT 


1 00 pv 


LL X LLL X bLU 


prnppm tv tv 7v pm 
CTGCTAAACT 


/-1 Tv pppmn PPm 

CAGGGTAGCT 


GCATTCCGCT 


GGCCTTCCCC 


ATGTTCCAGG 


1 OCA 

1860 


HTTP 7A prrppf 


TTPTPUPPTaP 

TTCTCTGCAC 


CTGTCCTTTG 


mp tv apftiptvpp 

TGAAGTGACC 


AGAGGATTTC 


mp tv mppmpmp 

TGATCCTGTC 


1920 


XCTGTCGCTC 


mPTv iv ppittiptv 

TGAAGGGTCA 


GGAGTTCCTC 


CTGCCTGGAC 


iv iv iv ^^^^iv m^^^i 

AAAGCCATCC 


TGACGGACAT 


1980 


AAATAAAACA 


AACATCAAAC 


TCTATTCAAC 


CCCCTGGAAC 


CCGTGTGTGT 


TACTTACAGG 


2040 


GCAAAAGAAT 


GGAGCAGGGG 


ATGGGTTGTG 


GGGGGGGGGG 


GTGGCATCTG 


GGTTGTCTAC ' 


2100 


AGTTGTGCAT 


TAAGTTGTAA 


TTAAGATGTG 


CATTTCTCCA 


AATAAGGGAA 


AATTATTCTG 


2160 


GATTATTTGA 


GTGAAGCTGA 


AAGGTGATCA 


TCTAGA 






2196 
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WHAT f ft CI AIMED IS: 

1 A method of decreasing the growth of a malignant cell wherein said 
method comprises exposing said cell to efficacious levels of a compound that 
induces expression of syndecan in said cell. 

2. The method of claim 1, wherein said induced expression is obtained by 
affecting the enhancer element of the syndecan gene (SEQ ID No. 3). 

3. The method of claim 1, wherein said induced expression is obtained by 
affecting the suppressor element of the syndecan gene. 

4. The method of claim 1 ,2 or 3, wherein said cell is steroid-responsive. 

5. The method of claim 4, wherein said steroid is estrogen or androgen. 

6. The method of claim 4, wherein said cell is selected from the group 
consisting of a malignant breast cell, an endometrium cell and a prostate cell. 

7. The method of claim 6, wherein said method further comprises 
exposing said cell to an anti-steroid agent. 

8. The method of claim 7, wherein said agent Is toremifene or tamoxifen. 

9. The method of claim 1 ,2 or 3, wherein said cell is a mesenchymal cell. 

10. The method of claim 6 or claim 9, wherein said cell is a human cell. 

1 1 . The method of claim 1, wherein said cell is exposed to efficacious 
levels of a composition comprising growth factors of said cell. 

12. The method of claim 1 1 , wherein said growth factors are selected from 
the group consisting of bFGF; TGF-B; and bFGF and TGF-B. 

13. A method for treating a patient in need of treatment to decrease the 
growth of a malignant tumor in said patient, wherein said method comprises 
administering, to said patient, efficacious levels of a composition that induces 
expression of syndecan in the cells of said tumor. 
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14. A method of claim 13, wherein said composition affects the enhancer 
element of the syndecan gene. 

15. A method of claim 13, wherein said composition affects the suppressor 
element of the syndecan gene. 

16. The method of claim 13, 14 or 15, wherein said tumor is a tumor whose 
growth is stimulated by steroids. 

17. The method of claim 16, wherein said steroid is estrogen or androgen. 

18. The method of claim 17, wherein said tumor is selected from a tumor 
the group consisting of a breast, endometrium, prostate gland or mesenchymal 
tissue. 

19. The method of claim 13, wherein said composition comprises an anti- 
steroid agent. 

20. The method of claim 19, wherein said agent is toremlfene or tamoxifen. 

21 . The method of claim 13, wherein the efficacious agents in the 
composition comprise growth factors. 

22. The method of claim 21 , wherein said growth factor is selected from the 
group consisting of bFGF; TGF-B; and bFGF and TGF-B. 

23. The method of claim 13, wherein said patient is a human. 

24. A method of inducing differentiation of a cell to a more differentiated 
phenotype, wherein said method comprises exposing said cell to efficacious 
levels of a composition that induces expression of syndecan in said cell. 

25. The method of claim 24, wherein the composition affects the enhancer 
element of the syndecan gene. 

26. The method of claim 24, wherein the composition affects the 
suppressor element of the syndecan gene. 
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27. The method of claim 24, wherein the growth of said cell is stimulated by 
steroids. 

28. The method of claim 27, wherein said steroid is estrogen or androgen. 

29. The method of claim 24, wherein said cell is an epithelium cell. 

30. The method of claim 29, wherein said epithelium cell is an epidermal 
skin cell. 

31 . The method of claim 24, wherein said cell is a human cell. 

32. The method of claim 24, wherein said composition comprises an anti- 
steroid agent. 

33. The method of claim 24, wherein the efficacious agent in said 
composition is toremifene or tamoxifen. 

34. The method of claim 24, wherein the efficacious agent in said 
composition is a growth factor. 

35. The method of claim 34, wherein said growth factor is selected from the 
group consisting of bFGF; TGF-B; and bFGF and TGF-B. 

36. A method of stimulating hair growth In an epidermal cell of skin, 
wherein said method comprises stimulating expression of syndecan in said cell. 

37. The method of claim 36, wherein said stimulated expression is 
achieved by affecting the enhancer element of the syndecan gene. 

38. The method of claim 36, wherein said stimulated expression is 
achieved by affecting the suppressor element of the syndecan gene. 

39. The method of claim 36, wherein said cell is a mesenchymal cell. 



40. The method of claim 36, wherein said method comprises administering 
a composition comprising efficacious levels of a growth factor. 
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41 . The method of claim 40, wherein said growth factor is selected from the 
group consisting of bFGF; TGF-B; and bFQF and TGF-B. 
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Fig 3. 



tctagatatteaaactcac cagatggagtgatgtccacc cctattggtgggagtgacta 
15" gtctttcctctgtcttctga ctcagatgcttagctagctc tttaggacccaccctcacac ^caaataatactttattt 
XVA actetcttaatacctttaac ccagtggagttgacatgaga aattaactaccataatttat aatatttcatttcataaatg 
m *22£SE£££* aatagaaagg t ?tgagcatg atggcccagtggtaaaggcc ^SSSSSS 
1838 taacaaataataacgggcet gttcttcaggcttgagggaa gtttattgattgaggctaaa agcaacccaaaggctccact 
37S8 ScctaSglgaaglcltgg atgtgctcwccacactgca tgtccacctgtggtgtcagc acctgggaagctgaggatga 
3678 t?qagaS?caaggtcat?a gcLLtagtataggctagc tggggtacatgggtcacaaa aaagaaaaaaaaataagcac 
ill a??SaatcccajScttga cagaccaalgggggggggat tgctgtgagtttaagacagc ctggcctacaaagaaaaacc 

llll £2££2S»JL w-t.^u ^"""""tStafa SSSSSSESSS 

3438 acaaaagctctcgtctgtgg ttcctattccctccttctcc cccaggtcttctttaatgta tactttttgtttgcttatpt 
All gcttgtlttgga?tttggc? tttaaagacagggtctcact atgtagctccaactatttgg ^^SSSS 
3278 actaaccaaaqacttataga gatctacctaccactgcctc ccaagtgctgagactaaagg eatgtgacactttgcttggt 
3?98 ^Sa«aalattttaaaag aacattttgaacattaatag atgtatgtatatatatcaet ctatgtagtatatatgttag 
-HI aca«tttcac"gagttac atatttaclctcaaaataag ttttttgtttttttttcttc tttttaaatttattttattt 
-3038 tttttttatttat!t?atta ttatatgtaagtacactgta gctgtcttcagacanaccag a ^agggagtcagatcttgt 
-2958 tacggatggttgtgagcacc atgtggttgctgggattcga actctggaccttccgaagag cagtcgggtgctcttaecca 
-2878 ctgaJccatctcaccagccc cttaaatttatttttatctt atgtccattggtgttttgcc tgcatgtatgtgtaaaagtg 
-SJI tcagaaactgaagttacaga ctgttgtgagctaccattgt tgtgggtgctgggacttgaa c "^gtcctctggaagagc 
-2718 agtcattattcttaaccact gagecatctctctagccctc gttttttagttttttttttt gttttgttttgttttttgtfc 
-2638 tttttaagattttcttattt attatatgtaagtacactgt agctgtcttcagacactcca gaagagggcgccagatctcg 
-2558 ttatggatggttgtgagcac catigtggttgctgggaattg aactccagacctttggaaga gcagtcagtgctcttaactg 
-2478 ctgagccalctc?ccagccc cgttttttaggtttttgaag acagggtttcctgtgtagct ctagctgtccaggaactagc 
-2398 tctgtagaccaggttggcct caaatttagagatttgcctg tctctctgcctctcgagagc tgggattaaaagtgtgcagc 
-2318 ccaacaatctactcaaagta ggttttgaaaaagctttcca tattaggagttaactagctt catttcagaaatactgcatg 
-2238 gaattcaaatgtgggaccat tcatagctactttggttttc cttcagtgacaggcattcgg catgcctattagggaagtca 
-2158 aatggcctggagaagtcatc ctgggtgagagggctaatgc attttcagcttgacagacac tgtcaacctatgcagacagt 
-2078 ctgctccagctcagatgtca attgcatgcagacctgcagt cagacgctaagctccctacc tactctccatcagcttagat 
-1998 gtaaggggtgctggaacaaa ggctctctctctctctctct ctctctctetctctctctct ttcttagaattagtattcta 
-1918 ttttattttatgtaaattgg tacttcacttacatgtatgt ccgtgtgaggatgttgtatc ctctggtactggagttatag 
-1838 acagctgtaagtcgccatac aggtgctgggaattgaaccc tgatcctctggaagaatagt cagtgctcttaacccctgag 
-1758 ccatctctccaacctcttgc atattgaggacagggaggaa tcacaagccatgtagggtgc ctgggctctgaggtcaacag 
-1678 gaccatagcctcctttcttt atgtgcctttcttggggtct ccctataggagtcgtcttcg ttgcctctttactgtctcat 
-1598 tgatctgggctaaacttatg cagttggaaggaaagatcaa gctggtcatgtttaaaacat gaaacagccteateagttcc 
-1518 cttcctgttcccgtctcccc cccccctcccgcccccattt tgagaggacaggaaggtaaa ataccaaagtgtcctatttt 
-1438 cctccaaatatcaggctcaa aggactgaagagctgacttc agatcccaaagccactgtgt taggaggcacctgcttttta 
-1358 ggtcetaagcccccctgagc cttgctattgggtattcttt accaagaccctcaaggatct aggcaagaactgggcaggat 
-1278 ctgtatgtagcccatagtta gacctagggcagctgagacg ccaaaagggagagtttcctg aggacaaaagtgttcaaaca 
-1198 caactgggtgctggctgttg ggctactcgtggaggtgtgg tgtgtgtaaaggaggctgtt gaattcccagaaggctggtt 
-1118 ccacagtgtagagtetacac tggggacttcccgagacgct gagcctcagatctagcttct cagtccaggccagctgatgt 
-1038 ggggctgaggaacaaggatg gatgccatctatggccctgc cttgcaggtgcaaagggcct ttggcaccatctacagattg 
-958 agggcaagacagggctggtt cttcctccttgctctcgctg ctatctgcctcgcctgtagg ctctctgggctcctttttgg 
-878 accgacacgtctgaaggagc ttggaaactgtgaggtccag gccccatagagaatcatgaa ggaacaggaattcaactgga 
-798 gctccgcagctggtcaggcc tgcggtcacctggaaacaaa gaggccatttattttttcct ttggtcttggacaaggaaga 
-718 gaaggggctttctataaata gaaagacagcaaaaaagaaa ataataataataataataat aataataataataataaaaa 
-638 caataacaaagccagctctt ccagacagtgctcatgtctt taaaggtctttaaaggtctg gagttcccagcaattaagta 
-558 aaggaccaagacctcagggg tccectatcctcagcccgtg gggaggtgggaaccatacat egatccctcggtttatatat 
-478 agcctcatcgctgtggggct ccgaggttgcccccaaaatc ttgctcacctggaggacccc tgggtgtcctcgcccagagg 
-398 gcgctgcagcctcgcacgta gagaactaacatcgcccttc tccagggcagtgcccccgga ctccggaccaggacatagta 
-318 gcgagtgcacctgggtctcc gtcagctacgcatcaaggaa ggcgcgacgcgggaattaca gattgccggcactcaccagt 
-238 gctcaggggaggaaggtggg actcagacctgcaagagctg aagagtggggtgggcttcga tcctaggaggcgtggaaggg 
-158 ggtgtggctggatccctggg gggtggggcgctccaagggg cggggcaacccagggggcgg ggcccgaggggtggagattg 
-78 ggactacccaggcccgcgga gctgggggtgggcggctagt tttgcaactgcagagccttt gggtttattataaggcggAG 
3 CTCCGCGGGAGAGGTGCGGG CCAGAGGAGACAGAGCCTAA CGCAGAGGAAGGGACCTGGC AGTCGGGAGCTGACTCCAGC 
83 CGGCGAAACCTACAGCCCTC GCTCGAGAGAGCAGCGAGCT GGGCAGGAGCCTGGGACAGC AAAGCGCAGAGCAATCAGCA 
163 GAGCCGGCCCGGAGCTCCGT GCAACCGGCAACTCGGATCC ACGAAGCCCACCGAGCTCCC GCCGCCGGTCTGGGCAGCAT 

Me 
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243 GAGACGCGCGGCGCTCTGGC TCTGGCTCTGCGCGCTGGCG 
t ArgArgAl aAlaLeu TrpL e u TrpLeuCysAlaLeuAla 

323 gcagggctgggaggcggcgg gaagccgggactcgccactc 
403 agcggggacttcttcccgcg ctgcctggcggatcctggga 
483 acggtccgtggatgccagga ggctatcccagctcgtggtc 
563 cccaattaagtaaaacgact ccacaggggtctgagtcgcc 
643 tgctctctcccgttgctgtc ttgctggccatctcagcggc 
723 ccaaacgccttttccataga ccctaattcaaaccagactg 
803 cctttgcaccggggcaagtt tgggcacagcagagccggcg 
883 acgggattaaggctgtttgt gggacacaagagggtggctc 
963 gccgagctggcagcgggagg ttccgggaagttggcttcag 
1043 agttgtgctgcccccggagg gcctcctccgcatggcccgc 
1123 tcccctcccccgactcggct ttgtgctgaagccgcgcgta 
i203 gaaagagggacgaacgtgga gctggcgactggtgggggaa 
1283 ggtctctctaatcagcggct tggcgacaaagagcttggtc 
1363 tcgctagcccgccttcccaa cgggcgctttgttctcggcc 
1443 cttgacgtgggccgggtcct gggtcgccgccagtgtcagc 
1523 tgtgaagtcccgggtgtcct ttccactggcgctttgccca 
1603. tatcctcccggaagtggcag tctcccaccgccacatctgg 
1683 cacagggaaggagagttgag cgctggtagataccaaggtc 
1763 gttaatgcattgccttcgct ccttcccaggtggctggtgc 
1843 gtattgcccatgagttgggg ttgtgtgggggcgccaaatc 
1923 ctgtgcactcctggctttaa ggttcaagaaattgtttgag 
2003 tagttaccactgtggagaag aaagtggagctacttagcat 
2083 ctaggcccaacgtgaggcct cactggggcagtacagatgc 
2163 gttctcttccctggtgtctt tagagaggcagacattcctg 
2243 gacaactagaggtggcgttt tccggaacttcctgtgtgct 
2323 gtagaattatctcaatggaa ggcttcctggaggaagaaga 
240S gactcccacttggaggcaag agtgttcgttttactttttt 
2483 caaacaaacaaaaaaaaacc ccacattgtttaaaagtggg 
2563 tgaaaaatgaaggcagctgg gat t tact taaaatgagagc 
2643 ctccctaagctcctgtggga agcaggtcagacagccctgg 
2723 gaatggggaaggggggggga gctgctgggatcagaggctg 
2803 cagtgtcctgtggtcttctg tttctcctatatagaatagg 
2883 ttccctttctttcctctccc ttactgtaatcatttggact 
2963 gtgggggaggcctgtgcttg catgatgtcactgctggctt 
3043 tggctcttgatgggccacaa taaaacctcattaacacaca 
3123 tgatgtttttcctttttttt tttttttttttttggcagta 
3203 ggattcctgaagtcctttga gaatgtataatggtagccag 
3283 aagtatccttggttgttttt tgttgttgttgttgttgttg 
3363 ggctgccctggaactcaata tgtagaccaggctggcctca 
3443 gcctaaaggtgtaggccacc aaaagtgctcaacttttaca 
3523 cctttctgttatcttcaaca atacactgctaggtgtattt 
3603 caagcagtctcctggttttt ggtggctctgaagaagactg 
3683 tccttctcagtctttggcag gcagagtgacactggtgtgc 
3763 tttatcagactgaaaaacaa atcaggttggttataattct 
3843 gtgtgggtctgatgtccctc gggaatctctgttgaggctg 
3923 acagaagcgtgctgcccacc ccactgtcctgttcctaaat 
4003 cggtgggagatgcacacaat gccaggaagcacacaggagc 
4083 atcagctccagctgccctgg actaacaaacaaggctagct 
4163 gagactgatcctcactcggt agccttgaactcttagcaat 
4243 gagccaccatatgcgactga aaccttgttcctaatccttg 
4323 ttagtgtatgttttagttcg cgtcctacataatctattgc 
4403 agcagagcctcacactgaag ctcagtcctgccactgattt 
4483 ccccatctgtagagtagaca tcaccatacctgctctttct 
4563 gtgcaaaagtgctttgtaag taaagtgctggggaaatgtt 
4643 ctgttgtgtgtggggttggg tggggtttttttagaggctt 
4723 aagcctttatggtatgtatc attgcctgattgtttgagtg 
4803 tgtgcgtgctctcgtgctca catatgtatggtgtgaatac 
4883 ctgtgagccctgcctggagt ttgcagatcttccttggaca 
4963 tggctgtagcctacattgtg cctttgggtaaaccctgagt 
5043 gttgcttgggtacagctaag aactcttcatagaaagttga 
5123 gtcttgggtggtacctgctt gccctctcatgcagtttatc 
5203 tgtcagccctgggaggtggg tttcgtggccacaagtgggt 
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CTGCGCCTGCAGCCTGCCCT CCCGgtgagtgtggcccggg. 
LeuArgLeuGlnProAlaLe uPro 

gccgatgccatgcaggcggc agcacgtggagggggagggg 
tggtgagccctttaatgagg actcctgtcccaattcctct 
cgggcgtcctgcagagtgga acctccattggttccccgct 
ggccttaggcgctccgccgg ccttaggcgccgcttggagt 
ctggcctccgccagtgtccc ggaggatgcagtggccatgg 
caggctgcacccccagcgcc gcggagtccgggcgctcggc 
cgggaacagggggaagctga cgttcggggtggcgggaggg 
agggacttcggtttttctct ggctgccccaggtgagccgg 
aacgctgaagaccctaagaa cccaactttggggtcgctga 
gcgggggaccctccccgcga gtggaccccggtacggctct 
gggaaggcgggtcccttggc ccgcccagtagggccgcggg 
gcttctgggtaggatgcagc catccacctttggtggggtc 
gagggtaccccagaaagtgc tctcccgccccaagccgccg 
cctgtaacccttccctggga accgccccgcagcgctggtc 
gctgccctccggtgtccacg cccctagcccccgcacccgc 
acccctggaaggcagaggcg aggtgcggagcctcaggctt 
tctgcttaacttcgatagtc ctggcaaaggcagacacgtg 
gtgtacaaataaagtggcac acgacacgtccccagtcact 
tctccatcactctggagccc aagagagggcctccataatt 
agggttctctgggagggcta tgaattccgaactgagtctc 
ggttgtggtttttgtgggac tcagattatgcctggaatca 
gcctccccggcccgcctggc attacctccggctctgttct 
ag t ac tgaat 1 1 c 1 1 1 ccag ccaggat c tggagaggtggt 
tgacctaagcccctcaagca cccattaataatgctgagta 
ggcctgggaggttgaaccct ctaggaaacaggtctaggaa 
tgagctgagcccccaggtca ctgtctgagctttaggatca 
tttttaagtttagtttattt tctctctaacagaaaacaaa 
tgcataagagtgaggacata ttcagagcttccccttttcc 
acatatcacaattgccagag agctggtccctttctcaggg 
ggaccagagagatagggagt gcttttgggtgcctgccttt 
ctagcaactactccccagag actgaagcaggtttgtccct 
agaaatggttatttgctctg gaatagtgacttgctatttg 
agtagagacactttccccag gtctggcagaatgggaggga 
cagctctccagggagggtgg agttggttgtaacctacctg 
ttggtagggagaagggactg gaaagaatgatgggaaagat 
ctttctagatctcccctccc ccttgctgcagcaaaatttt 
actttttttttttcagtcag ctcaaaattgcctccttata 
ttttgttttgttttaagaca aggtttctctgtatagtcct 
aactcaaagaaatccaccta cttctaactttcagtgctgg 
aagcagtcttactttgagca ggattctgaaacccttattt 
agtccctcatgatgctgggc ctcctcaagtggcgccaggt 
tgtcccagtgactggcagtt tgaattcggagcttctcttt 
ccaagcctggagcttctctg tttaattctagtttattttc 
tataaacacgaaggtctcac ctttgcgtacgtctccggct 
ctgcagtgtgtgtgcgtgta gaaagggcaaggtagaatgg 
gatgaagcactggcccggtg aagagcctagagaactccct 
ttgagttccagcttggcagt gtcttctctttggtgacttt 
cactctcagtattgataatc gaaggtccttggttctgttt 
tctcctgtctcaactttcaa agagctgaaattacagactc 
actgtgaacgactcttgggt ttggttctttctccatttct 
ccatact tagaaacaacagg t tagagacagcat tgggtcc 
accgtgtcagctcaagtgac tcacttccaactcctctgct 
gcccacattetgtcattaac atgttcatttcataacgatg 
agctgtcgataatggttagg gttaactttttattgagtgc 
ggtagttttcttacttcttt cctacttagcttttcttcct 
tgtgcactgaggcacgcctg tgcatgtttgagagtatgct 
actgtagagtgcaggccggc acactggggctggctgaatc 
ctcctgcttgtgagcatttt gtgtggagtgactgtttagc 
attgggaaacaccctgggct gtggctgtgtgtgcccgacg 
gctcacatgctattagtatt aactgagtgctaaggaacct 
ttgagcttggcgaacacact tacagatttagtagagcttt 
agcttggaatccaagactcc tggcttctaggttgcattct 
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5283 cc tgCggtt c Ctt cca aggg "^'*"«"°'""' ^t^»ttte».t £££££££££ 
5363 ctgtttcctaattgcctttt ttttgttttcctctcaatet ^""gaagt^tgggcc g a giaccttgtgcece 
5443 ctgccctagtgtgtgcagag ggaagatagtggaacaggaa ?"^*^^!"^ta ggtgagaatgaacctittag 
5523 ggagcagagtcagcatggtg tcatatcgcttttggctatt ^^"^fjj™ ggecacttgtttgtetttt 
5603 aaccttctagaaccttctgt tgagtatgacaggaatgccc ^"tagggtccgaagcgca gg | | 

SS SSS5SS5SS 2=?~ SssSsK SSS5SS=5 
SS SSSSSSSS ™ E SSS SS3KSS3S 
SS SS5SS SSSSSSSS S SUA 2SS5SS3SC 

6163 tcccatctcctactgtttat ggc "tgtgggctacctaat gcttctgtcctgctggc?tc 
6243 agttctgcccagtgactcac ct 9*3 cct 9 Ca 9«^gtgt ^acacaggcgccc. ggg g 9 tccctccctt 
6323 agcctttctageccctggtg ttctcggcagtggtagcatc ^99***"^"^"" ctcctrtcttttgtcctage 
6403 /cttggtgtcttcccccttt ^^^SSSS £3£££S2££t ft^cagtcacaSctggfg 
6483 cttgtetctgacttcttgtg atectctgggcetgtgaaat «9""|^ tcattt tttctctttattccttctgg 

SS SSSSSSS SSSSLi-SU -Trrxsss 

S=bsssb 2SSSSS2S sssssss ssssssss- 

SSS SS5SS3S3 25=5=335 «25= 

sssssss sssssssss ssssssssss ssssss 

?JS cccttcatctSccclattt ggatctgtcgtatcagcatg tgctggccctgcttccatac ■ «S«2S3bS£ 
4cni rrrt noflaa ttcactaattq gggacttgagtgtatcagag gggcacaaagtaacattaac tctggtatcctctgcagcaa 
llll atcaaaaatcccctc?c«a gglgagt?c?cagtggatat ggaggtcaggtttgggcttg tagggccccagcaagagtcg • 
Vlll SgltglS^cageSct !!4aggaagatgagggtgc tgtgttgggatcacatctct ^^atgg^tgttgggg 
•7843 aaLatoaaacccttgcttc tgacccctaagcttggtctt taggtggccacagtctctgg gttctgtcctacctccctgc 
?!S tc«gtg?gcttcaaaggca t g ctaaagggactctcggcc attccgaatggcacagtgtt ccttctgttctcccaccecc 
SS agaaggaggcaggcctjgat tgtagattcetagaagtaag tggccctgagcatgctgttg atgaacctggaaccaggcag 
8083 o?tolaca?cc?aggacctg tltttccatagaagtctgaa tcagtctacctttgggactg agtaaggggctcctcacata 
SS IcalcCgctagtclatctl ggctgatctaaaccacatta ggctgaagagaagcatggtg tacagtctggtccacccgaa 
SS ccacaSctaactttatcag ?Ictcgtataattttgeagg taactttttagctctaagcc tgtetcetcatctgtgaaat 
SS cgggtccctcaSJcSg^c tagaagggctttt g aaaa g a ttaatgaagtagtatgccga I^^SSSSSS 
8403 gactggagcaagtctctagg agtactaaggatagcctgct gtgtgcagcacccccaggga ct 9^cctgagtaggagggt 
8483 acagagtcttcatgtgaatg gcccttctggtcttgccccg aagttagtgttgatgtcata jagtctacaaacatgccttt 
8563 tgtccttcctcagaagtcca agccttteetggcagaccag acattcatctccactgagcc tctatgtgagactggctcct 
SS ggcctgag^gtgtgggctg agctggcgaatgggaaaact agacacctgggcacctgggt S^ggctcgggacagcagtg 
8723 ttteagttgtaggcactgtg cccctgcctggagcttctga ctgaaggttaccctgagagg aagcaggttccctatagaca 
SS "aacSagctgggtcaga? tgcaaggtgggtgtgcccct gccctgacccattcagtgca aaggctgctcttctgggagt 
8883 gagagctc?gacaggactgt gatggccgaggggtctcaga gcaaacctgcctggcctctc ^f^ctgatggatatgtg 
89 53 ctcttaaacaaataactgtc cactttgcctcaatttcaac atctgtaagatagatagggc gttatggtctgaaaatggtt 
SS «aaagattagltag«aat acagggLagtgctctgaca ggtacctggcaccttactca acaagtggctggagtgcctg 
9123 atttcctaaggtctcgacct gtccctatgcttcaagtgcc cctacagccttggtcaggcc cttaggttctcccacccacc 
SS gctggccccaggaccLgac Igctggaccctgaccccatt tttcctttaagccacctctg cgtcaactctaaaaggcggt 
9283 ggag? t gttta?ctaggctg tgaggtgtcagagaaaggac ctgggccgctttgttcctgt gtgggctggggccaetccag 
9363 gaactgagaaacccacccac cttttcaaaaacagcctctt ctcagagtctggcacctcag ctagccaccatgctgtggga 
9443 ccactcccagcatgctctgc ctttggtttgtttcccaggg gcctcagtgccttttaaaga tgcacaggcatctt^agttc 
9523 aaggggaaagaggaaatgaa gtgtatttgctggtggtggt attcctgtcacttgcattct cacagaggctaaagaaattt 
9603 gclc^tgtatcttctagtc tcttctttatgatcttttec catctgttgtatcccaactg cagggccccagttctagaat 
9683 taacccctcccccataggaa gccgacttatgctataat g t gaatgacaagtatcctttag cccttcccacaggcatttta 
SS at?ttcaaaagggca«gca caaccgcagagacactaaga agagaggtttggtgatcaga ^ttacagccccagcctccca 
9843 gctggtggcccggctggcgc aggtgtgtcgaaagcagtag tttctgcttcagtgaaactt «W«te^UttWea 
9923 gtteaggggcggaatggcca tgcgaagtctatgtgtcaca ggtgtcaggcccccatatcc tgctgagtctagaatcagct 
iSS acgtagcagt^tgggggta «gccagactgggagtttac atcccagaagcgagaatggt sgggttcctatactgctcca 
10083 gacaggatcttccccccaag tttgtcagccacctctcttc aagtcccttggctctgacca 9 caa 9*=^ a ^« aaa 9« 
10163 actgaggaggcccttcactt ctttttaggatagtgtgggg ccagcatggt ggg ggttggg aatggctttctgtctcttcc 
10243 atcatcacaggctacttccc agagacactttgattctggg catctccagcagtcacctgg cccacaatgctttgctgccc 
10323 tttqcttcagccact g tacc t gg tt g tccctt g aa g gcga g cca g a g ctccta g gcagag agcatgtgctatacaaagcc 
10403 gtaggctgggccctgggaac cttcttgctgtcatcctcct gtcaaacccctatggtatgg tagcccacataaggcttgtg 
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10483 caaaaaacaggccaaaacat aagttatcttttcactctat cgggtcttctcattttccca tggtacgttcggctggccag 

10563 gcccaaaagacttgaagaga ggtggctggcaagtctaggg gaataggtctatctggttcc ctccaggagcagtgcctagt 

10643 gagaggctgggctgggcagg gcagggccccttgctccaca ttgcctgaagtcccgccctg cccgtcctggctgggatctg 

10723 gcaggtcttccagctccaca cccggctctcagctgagcct gctcagagactagtcctggc atgtgggttgcagggctggt 

10803 tccagctccaccaggaggta tgggcgtctgggtactcatg ggacattgacctgtagtggg tatggagagtggaggaatgg 

10883 tacaggcaggtgtgctggtg ctgacggacttgactccggc attgaccttggcttgcagtc tggtgttaaactaacaggga 

10963 atgctgacaaaaaagacagt tattaaaaccaagacaggat actgctttcccactcagccc attcccaagaatccccaaga 

11042 cgtacaggaaatgtgcaaca gcagtgggaattgctgagtt gggggatgtgggtgagctgt gtgctcccagggaattttgg 

11123 gaaattcccctccgttgaaa tgctgtcagggtctgagcct tggaggtgtttttggggtgc tgtgctccccagctaagcag 

11203 ctaacagtcctctttacctg ccttgtcctcaccttgcccc accctgggttgggcctctcg ttcactccctgctgggtcac 

11283 cagtacttcagtgcaggtct cagcttgattcttggtggag agagagaaagttgataaatc agggtgcctgtcagccggaa 

11363 atttgggtgtgtcctgaagg caccaatgggggccctccct tctggaggtggctttaggaa ggggtttctgggtcttgagg 

11443 cctccttacagtttcttagc tccatgggagagaagtgagg agttgggtatcgtcacccca gcatgaatctctggtcacct 

11523 ctcagcatgcactgtccagc ctgatctttgagtgccataa aagaacagaattatcctctc agagcacttcatttcccgcc 

11603 agcacagtgggtacagagac aagctgcccagactcccagc gagggactagttgagcccca gcatgggactagttgagcta 

11683 gacctgatacagtcccagag agcctcgttgaggaagcttt gggaaaattcacccagcatt tcagccaggactggaggaaa 

11763 aggtgattatgggaaagaga gcagtcaagaccccaggctg taggacacaggatacaaact gagagctaccggataggagt 

11843 aggttttagtcacaatctct cctgtccgccctaccctcca ggagacattgcaccttgtag aacagctgccccggagtcca 

11923 cctttgggcccccctgggta gctcagtagtgtcagcatcc tctcattgacatcagtcagg ttacacagtggggcagctaa 

12003 tgtgaaggcgctaggctggg aagccagctacttgggaaaa ctaggttgttcctggtaggc cctagcaggaaggcagttcc 

12083 tccttttcttggtggcttta ggggtctttggaagctttga atgttccctcagctcgttgg tgaagcaggccctcctggta 

12163 ctgtggtgtttgtcttcgaa gagtgaaggcattggaagta aagactgatggggcgccttc ccaggatgctttgcttcttg 

12243 cgctggcttacagagctctc ttgctacctagtgccttgac tttgaacaccagattcagtc agggaacaggagtagaggtc 

12323 ttgccttgctgagcccctgc gcactgcaggaaaagactcc tctgagtggagcctttcctc ctcaggtgactgctttcaaa 

12403 gtacagcagcctctgagggg gaagtgtcatttgacattgt ggtagttcttggggtccctg gatacagatgtcatgcccag 

12483 atcataggtctgtttgtaca gagggaggcgagttctgtag ctcagagtcctcagtacccc agagttgtggctctaggggt 

12563 gagaggagaagactacagcc cttcaatcacaggtctgacc tgtgggtaggggtagatctc ttgcatactatgaacctgtt 

12643 tgaaacccctgggtatttgc tgtggaatagagtcttggtt gggtaagaatggtggatgtt tatcttggtgtgactctcgg 

12723 gtgggggtgggggatatgtc cctgtctttcccaatgtagt atgctgagtggacagagacc gtgtgactgaagcctgggct 

12803 cctggaacaggtgtgtgttg gtggggggtggggcgcaact atctgggatccagactgctt gggaatggctgtgacccagc 

12883 tcctttgataacagcagctc tttgtcactggatgttgtga ctaatgggacttgttgattc agttactcggctcccaccca 

12963 cagacgccggggcttctgtt gtggcaccaggcagctgcag acggcccacaagtttgcctc gctttcccactccacgaagg 

13043 taagttcccagcactgccca aattagagacttgtgagtgg tcccctcataccccactccc tgaggcttctcctggaaggc 

13123 ctggaatggggcactgggtg tgtacgtgctgtggtttctg ttagggtcaagaccaggctg tttcttacctggctcgtacc 

13203 tccaagtttccaggtgatga gtcctgatttttgaagtgaa ggaatccatttaatatcaaa attctgtgaccttaaatttt 

13283 tttcttttattatgtgtcat ttcatatgtacgcatatttt tttgtctgtgtgtggacatg cttgtggcgatcagaggaca 

13363 cttcagaaagtcagttctct cctgccgtgtgggtcctggg gaatcaaatccaagttgtca ggctttatcctgaaaataaa 

13443 aagtagacagcccttgggat ccaaagcttcttagggctgt gtgtcttagacaccaccagt gttgcacagctggtaacatg 

13523 acagtgtcctggagtgctga ttggaagccacaggcctctg tgcagggcggtagacttcca gggtacggggcaggtgggcg 

13603 ttctctacaaaaaccttgta atcgcggacgtcttggagat gccccctaggtatcatgatt ttggtgtgtgacacagctga 

13683 actgtcttcatactcaggat atcatgaagtgctggggtgc agaccactctcagcctcagg cagccaggacccggggctcc 

13763 atcagattgcggtgactacc acagagggtggccttccttc cggtcagtgtgggtgtggga gctggcaggaagtggctcca 

13843 ggcttcctttaagcatcctc tgcccacagccccaaacatg ttctttggcaatggcttgca actagaggtgaactctctcc 

13923 tgtactatgtcctgacccac gctgctgcatctattatacc tttcacacgcgtgatgggta cccagcggggctgctaggca 

14003 gggttaagcactcatcttgt ttqctggtgctgaagctgtg gtaaagaaactgaggccatt ttcccttgagagagatggtc 

14083 tcagccaggtctttctcggc ctggggagcccggaagaaag gatgtactacagtgagtgga cacttgttggctgatggcct 

14163 tggtaggtccttcaccctgg gaagtgctgtttcttatctg ttagagatgctgacctcagc aggactggaggaactgcatg 

14243 ggaggtgtaggaatgaaagt gagtggggaaaattatctcc agccctagggaagtctgagg cctgtgtcccctttgtcctg 

14323 gactgggcccctgccttggg tgtctgtccagggtctttgc tctacagccccagcggatgc ccaaagtagacgagtcaact 

14403 ggtcctttctttcaccctgt gtccacttctcatgtatcta ccttcataatccttctaggt aaaacaagcctctaactttg 

14483 ggttttcaaatcagccagct tccaggctcgatagtacgaa ccatgaaaatctttcttacc atgaggttgttttctagtgt 

14563 gtgtgtgtgtgtgtgtgtgt gtgtgtgtgtgtgtgtgtgt gtacgtacacatatgtacct ccatcagtgtgctgtgcgtg 

14643 taccacagcagactcgtgag gaggtcaggcaaactttata aaaatcttttttttttgctt cacttgagtcccagggtcac 

14723 acagtggcaagtgctgagct ctgttctctgttcttgattt gttttgtgagcagctgatgt tcttaaggcttgcggagggg 

14803 aaaggtagggctggcttgct tcttccccgagtggcggtca atccctagacatctctaagc cgtggccacacgtcctggaa 

14883 ggacccaggtcagaagtgat actgagatggccctgtgagc cctctcgaacacacagggtt gtaaatagtacctgattgtt 

14963 acattggagactcgtcagct gggtggagtcctggttcaga gggagttattcctcccccca catttcttctcttctggggc 

15043 tgaagtctcttccttcctta cctgtgatgctgtcatgata ggtcccagctgagagtggag gcggggcagtcagggagctg 

15123 cttctctttgcttagcaggg gttggagacttggggtgtag gggttggctccccctttccc tgccctggacctggtttctg 

15203 gtttcagcagagattcgttc tagaaacttgttgcgtaaac aagatcacaaagcgataagc ttgagcaaaacccaggggaa 

15283 caaattgcttccctgtgaag acccaatcttagctcttaga gaagccctcccttttggaaa ttgctgactttcagggcttc 

15363 tctgtggaggaaagaggcta gccgccgtatgtttgcctgg attccaataaatctttgcgg ccttggctaccccttgttga 

15443 acaaggtctgcactcctaat gcgtgcctcaggtggtctga gacctctaccccatctccag cttttccttcctatggaggg 

15523 agtcagtgggttaggagaga atggagttgagtcctggaat gaggaggaagctatgaactc ggggcctgttcctgtctggt 

15603 gggtgctcttctccgccgct gaaggaggcagccgcaggga agactaccacaggaatccga gtaccacctggagcagtgta 
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15683 tacaggatgtgggctgatgt gtggtaagggcatgatgggc tgatgtgtggtaagggcatg ggatctgactgctctgtgga 
15763 tgggccacagggaaattttt gagtgtctactgcagtagtt ctcaacctgtgggttgtgcg ccccctggtgggagttacat 
15843 attagatatttacattatga ttcataactgtagcaaaatt acaattgtgaaagaaccaag aaatcaccgcagcatgagaa 
15923 cctgtattaaagggtcacgg tgttaggagggttrgagagcc actcatcctctgggtctagg ccatggcgggctgtaactgc 
16003 tctctggagttaagccacag tgaaccagctgtccttgcag. atggacttgtggaggctcca aacctttgtcccaggggaga 
16083 agagcttgcttttgctttgt acttttaaaggaagttcagt ggtcttcgggccttgtggct gctgtgtgtggaagtgcccc 
16163 tgtacaataagctgtataga tcgtgtacaactgcagtttt cctccgtgggtccaccaacc actcctgactccacggatga 
16243 gtgaggccagtagggctgtg tgtgggtccctaggccaagc atcctggaccacgatgagcc tcagctagaccactctggat 
16323 ctttagcagaggctcctaga gagctggctggcttcctcct gccttcttttctcttaaaac ttcgtctcaatcggaagctc 
16403 ctctgtgcacgtgacctcca ggcctgggggtcgccacaaa tcccctcatcacaagacgag cagctcgcatgagggacacg 
16483 acacttgttacctaccaggc tgtggggtttttgttggttg gttgttttgttttgttttgt ttttttacttgtacagaagt 
16563 gttgtgacatcagatgtcag ctgttagtgctggcaccatt ttacaggtagggaactgagg ctgtaagatgtgtagtgaca 
16643 tcgctaaggccactcagttg gtgaggccttaccaaggtca ggtctttggagccttttgct gaaccatgtacttctatctc 
16723 tgttttgttgaaacaaagtc tatatggctctggctagcct ataaccccatatgtagacga ggctgacctcgaatacactg 
16803 cagtcttttatgtctgcctt ctgggtggcaggattgaagg catgtgattcctcctaactg tacactttaaaaaaaaaatc 
16883 attctttgttctggtctgtg ccagggccttgtaagatgtt ctgtgctgagctgggctatt tgggttagtctcattgctga 
16963 gcagggcccctgtatcttcc ttctctgtcacttgcttacc tgggtcttcctcctgcacta gctatcctagaaccagtact 
17043 gagagcaactatgggcccaa ctctgccccttgcccagcct gcttagctgggggcggtgtt ccacttccctgcccaagtcc 
17123 tgtgggactgtgtttgtact ccaccaccttcagttccttg gagctggagcaggccaggcg gctgcattcctgcagctgct 
17203 gttgccagggagagcccatc ccattcacttcagtctcctt aatgtagaagccttgtcgaa ttagcctccactgtccccaa 
17283 cccaagagtaccctgtcctt tcttcactaagaaggccagg atacagtccttcctgtggct gataagacaggccttgggac 
173 63 aaggcctgggaccacactgt gtgggcaaagctgcttcagc accgatggctcctccatgcc aagcttggctctgcttctca 
17443 cagttgagacttctgtgcgc acacccactgtctagctcag ctggacactgattttcttta aatgtatagattttggggtg 
17523 gggtgtgctgaaagctccca ctgatgccccaagcctgagt ctcagagtatgatcaattga tggctttcatgggtatcaca 
17603 gcttctgttcccaggtcaga ctccctgaccagtcagagca tcctggggttagacaatgtc cccgtcacttgtgcctccac 
17683 ctggcaccaggctatgatgt tatggcattgagggtatgag aaggaccaggggtttcccag agttacgcccaggcgcacag 
177 63 gcaattgtttcctacatgtg tggctggaatggttgggtga gccttttcagctgcctacaa taggaacccagggatactgg 
17843 gcattgaccaaggcatatct catacccttttcttatcttt ctgcagCAAATTGTCGCTCT AAATGTTCCTCCTGAAGATC 

GlnlleValAlaVa lAsnValProProGluAspG 

18923 AGGATGGCTCTGGGGATGAC TCTGACAACTTCTCTGGCTC TGGCACAGgtaagactgacc cagaacactgagatggcata 
InAspGlySerGlyAspAsp SerAspAsnPheSerGlySe rGlyThrG 

18003 gatcatggctggagtggtga gcaggcagtcacccagcttt tagtgaacccccttcttctc ccatcccatccttagccatt 
18083 ggagtcaggacagtgccaaa aggaagaatggtatccagct gcaagccactcagctaagag aaactctcagagaaatgaag 
18163 gggttccaccaggccatggg cagccactagagccaaccct tggaggagtttgactccact gagccttggtgtggtgtttc 
18243 catctgtgagatgggaatac tttgcccaagagcctgttag aagctgtaggaagcacagag tcggctaggtatagatttgc 
18323 tctcacctccatctctcgat accagttctctgcagagctt gggtttgtgggaggggtggg ggggtgaggggagaaggctg 
18403 tgagctgcagctagccagag gggtctcccagaagaatggg gagagctaagaaggaaagtt gaggtcacagtgggaaggag 
18483 accagagcaaagggttggaa ggtaggtaaaatgcagccgt gtattcttgggagccttagg ccttgggcaagagggtagaa 
18563 gaggtgtttgtcctgggctg cagtcctgtatcagctctgg tgtcttggcccacgctcaca gcaggatcccttcccagatt 
18643 cccgagaatttctcacagtt cagagagcacgctacttgta ggcaggtgaggctgcaaagg acagcttttctggcctaatt 
18723 ttcaaagtgagttcagcctt tgctaggtcacctttggggt ctcagaaggcttcagctcct ggtagagcatgaatcacgtc 
18803 aggcgtgatgctggagacct ctcctaccctgacaccccaa acccccacctctgaccctgc agGTGCTTTGCCAGATACTT 

lyAlaLeuProAspThrL 

18883 TGTCACGGCAGACACCTTCC ACTTGGAAGGACGTGTGGCT GTTGACAGCCACGCCCACAG CTCCAGAGCCCACCAGCAGC 
euSerArgGlnThrProSer ThrTrpLysAspValTrpLe uLeuThrAlaThrProThrA laProGluProThrSerSer 

19963 AACACCGAGACTGCTTTTAC CTCTGTCCTGCCAGCCGGAG AGAAGCCCGAGGAGGGAGAG CCTGTGCTCCATGTAGAAGC 
AsnThrGlvThrAlaPheTh rSerValLeuProAlaGlyG luLysProGluGluGlyGlu ProValLeuHisValGluAl 

19043 AGAGCCTGGCTTCACTGCTC GGGACAAGGAAAAGGAGGTC ACCACCAGGCCCAGGGAGAC CGTGCAGCTCCCCATCACCC 
aGluProGlyPheThrAlaA rgAspLysGluLysGluVal ThrThrArgProArgGluTh rValGlnLeuProIteThrG 

19123 AACGGGCCTCAACAGTCAGA GTCACCACAGCCCAGGCAGC TGTCACATCTCATCCGCACG GGGGCATGCAACCTGGCCTC 
InArgAlaSerThrValArg ValThrThrAlaGlnAlaAl aValThrSerHisProHisG lyGlyMetGlnProGlyLeu 

19203 CATGAGACCTCGGCTCCCAC AGCACCTGGTCAACCTGACC ATCAGCCTCCACGTGTGGAG GGTGGCGGCACTTCTGTCAT 
HisGluThrSerAlaProTh rAlaProGlyGlnProAspH isGlnProProArgValGlu GlyGlyGlyThrSerValll 

19283 CAAAGAGGTTGTCGAGGATG GAACTGCCAATCAGCTTCCC GCAGGAGAGGGCTCTGGAGA ACAAgtgagtggctttgcat 
eLysGluValValGluAspG lyThrAlaAsnGlnLeuPro AlaGlyGluGlySerGlyGl uGln 

193 63 ttcctgggtggccactagtg cctgcacctggccgcctaat gtcctcattacagtgacagg tgacagggtcccaccttcct 

19443 cctgcccgaaacagactgat tgcaagatcaggaggtgggc gactccttagatgtcattca ggagcttacagcagggtgaa 

19523 ttttccgtcttagaccttca tgggaattttcacacaacaa tgtgtacgttgtgtcactgg aggcggtatctgtgtcttgg 

19603 cctgccagggtcccaggtgt gactgactgcattc.cttgac agatgctggtataggttggc tacgtctgatgggggtggca 
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19683 ggggatcccatcaggtatgg cactgctcaggttgccgttg tgtcagtggctccagctgac ctgatcccaacctacccttc 
19763 t g t agG ACTTC ACCTTTG AA ACATCTGGGGAGAACACAGC TGTGGCTGCCGTAGAGCCCG GCCTGCGGAATCAGCCCCCG 
AspPheThrPheGlu ThrSerGlyGluAsnThrAl aValAlaAlaValGluProG lyheuArgAsnGlnProPro 

19843 GTGGACGAAGGAGCCACAGG TGCTTCTC AG AGCCTTTTGG ACAGGAAGGAAGTGCTGGGA Ggtgagtcttctttcaggtg 
ValAspGluGlyAlaThirGl yAlaSerGlnSerLeuLeuA spArgLysGluValLeuGly G' 

20923 gagaggaggaggcaggtggt ggctctgaggtagcctgggt tgctggggtgaagcatcttt agcagcagggtggggaagga 
20003 ggagggtcaattctactcca ggccacctcctaggctgtcc gtctagtctgggagagacta ccactgaccccgtggagcta 
20083 ctgatctgagcctgcctctg ttcactccctagGTGTCATT GCCGGAGGCCTAGTGGGCCT CATCTTTGCTGTGTGCCTGG 

lyVallle AlaGlyGlyLeuValGlyLe uIlePheAlaValCysLeuV 

20163 TGGCTTTCATGCTCTACCGG ATGAAGAAGAAGGACGAAGG CAGCTACTCCTTGGAGGAGC CCAAACAAGCCAATGGCGGT 
alAlaPheMetLeuTyrArg Me tLysLysLysAspGl uGl ySerTyrSerLeuGluGluP roLysGlnAlaAsnGlyGly 

20243 GCCTACCAGAAACCCACCAA GCAGGAGGAGTTCTACGCCT GATGGGGAAATAGTTCTTTC TCCCCCCACAGCCCCTGCCA 
AlaTyrGlnhysProThrLy sGlnGluGluPheTyrAla 

20323 CTCACTAGGCTCCCACTTGC CTCTTCTGTGAAAAACTTCA AGCCCTGGCCTCCCCACCAC TGGGTCATGTCCTCTGCACC 
20403 CAGGCCCTTCCAGCTGTTCC TGCCCGAGCGGTCCCAGGGT GTGCTGGGAACTGATTCCCC TCCTTTGACTTCTGCCTAGA 
20483 AGCTTGGGTGCAAAGGGTTT CTTGCATCTGATCTTTCTAC CACAACCACACCTGTTGTCC ACTCTTCTGACTTGGTTTCT 
20563 CCAAATGGGAGGAGACCCAG CTCTCGACAGAAAGGGGACC CGACTCTTTGGACCTAGATG GCCTATTGCGGCTGGAGGAT 
20643 CCTGAGGACAGGAGAGGGGC TTCGGCTGACCAGCCATAGC ACTTACCCATAGAGACCGCT AGGTTGGCCGTGCTGTGGTG 
20723 GGGGATGGAGGCCTGAGCTC CTTGGAATCCACTTTTCATT GTGGGGAGGTCTACTTTAGA CAACTTGGTTTTGCACATAT 
20803 TTTCTCTAATTTCTCTGTTC AGAGCCCCAGCAGACCTTAT TACTGGGGTAAGGCAAGTCT GTTGACTGGTGTCCCTCACC 
20883 TCGCTTCCCTAATCTACATT CAGGAGACCGAATCGGGGGT TAATAAGACTTTTTTTGTTT TTTGTTTTTGTTTTTAACCT 
21963 AGAAGAACCAAATCTGGACG GCAAAACGTAGGCTTAGTTT GTGTGTTGTCTCTGAGTTTG TCGCTCATGCGTACAACAGG 
21043 GTATGGACTATCTGTATGGT GCCCCATTTTTGGCGGCCCG TAAGTAGGCTGGCTAGTCCA GGATACTGTGGAATAGCCAC 
21123 CTCTTGACCAGTCATGCCTG TGTGCATGGACTCAGGGCCA CGGCCTTGGCCTGGGCCACC GTGACATTGGAAGAGCCTGT 
21203 GTGAGAACTTACTCGAAGTT C AC AGTCTAGG AGTGG AGGG GAGGAGACTGTAGAGTTTTG GGGGAGGGGTGGC AAGGGTG . 
21283 • CCCAAGCGTCTCCCACCTTT GGTACCATCTCTAGTCATCC TTCCTCCCGGAAGTTGACAA GACACATCTTGAGTATGGCT 
21363 GGCACTGGTTCCTCCATCAA GAACCAAGTTCACCTTCAGC TCCTGTGGCCCCGCCCCCAG GCTGGAGTCAGAAATGTTTC 
21443 CCAAAGAGTCAGTCTTTTGC TTTTGGCAAAACGCTACTTA ATCCAATGGGTTCTGTACAG TAGATTTTGCAGATGTAATA 
21523 AACTTTAATATAAAGGAGTC CTATCAACTCTACTGCTTCT GCTTCTTC T TCTCTGGACTG GTGGTATAGATATAGCCACC 
21603 CTTTGCCCAAACCCTGGTAG CTCGGGGAAGCTTGGCTTAA GGCTGCACGCCTCCAATCCC CCAAAGGTAGGATCCTGGCT 
21683 GGGTCCAGGGTTCCTCTGAT TTATTTGGTTTTGTTGTGTT GTGTTGTGTTTTTCTTTTGG CTAAACTTCTTTTGGAAGTT 
21763 GGTAAGTTCAGCCAAGGTTT TACAGGCCCTGATGTCTGTT CTTCTAAATGGTTTAAGTAA TTGGGACTCTAGCACATCTT 
21843 GACCTAGGGTCACTAGAGCT AAG CTTGCTTTGC AGGGC AG ACACCTGGGACAGCCTTCCT CCCTCATGTTTGCTCGGACA 
22923 CTGCTGAGCACCCCTTGCTT ACTTAGCTCAGTGATGTTCC AGCTCCTGGCTAGGCTGCTC AGCCACTCAGCTAGACAAAA 
22003 GATCTGTGCCCTGTGTTTCA TCCCAGAGCTTGTTGCCAGA TCACATGGCTGGATGTGATG TGGGGTGGGGGTGGGGTCAT 
22083 ATCTGAGACAGCCCTCAGCT GAGGGCTTGTGGGACAGTGT CAAGCCTCAGGCTGGCGCTC ATTCATATAATTGCAATAAA 
22163 tggtacgtgtccatttggac agcagacactttggtgtact tgtgcagtctctttttggtc tggaccatgtccaactctat 
22243 ctggtttttggaatgggagc ctaactggcctgtgttctgg cttggtaccaaatagcaaca gtcagtggcatccttgccca 
22323 ggccccagggcaggactatg ctcttgccatatccaggact cccgactttgcacctgtttt . ccctctgtgtgtagcatcat 
22403 gaactccagctaggttgttc ctttccctggggtcaggagg attctgctgactctgaatgt caggatttgcttttgttctg 
22483 tttgcttattgggcaattct caaccttcactagcaacagt ctcatgtgtcaggattacaa gtattgcttgcacattgagg 
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